facebookresearch/metaseq

Integrate LucidRain's RotaryEmbeddings

Open

#621 opened on Jan 27, 2023

View on GitHub
 (2 comments) (0 reactions) (0 assignees)Python (6,195 stars) (701 forks)batch import
enhancementgood first issue

Description

See https://github.com/lucidrains/rotary-embedding-torch/blob/main/rotary_embedding_torch/rotary_embedding_torch.py

And from PaLM paper:

We use RoPE embeddings (Su et al., 2021) rather than absolute or relative position embeddings, since RoPE embeddings have been shown to have better performance on long sequence lengths.

Contributor guide

Integrate LucidRain's RotaryEmbeddings · facebookresearch/metaseq#621 | Good First Issue