https://ai.meta.com/research/publications/memory-layers-at-scale/ Memory Layers at Scale | Research - AI at MetaAbstract Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense feed-forward layers, providing dedicated capacity tai.meta.com기존 긴 문맥을 처리하기 위해, ..