facebookresearch/metaseq

Integrate LucidRain's RotaryEmbeddings

Open

#621 opened on 2023年1月27日

GitHub で見る
 (2 comments) (0 reactions) (0 assignees)Python (6,195 stars) (701 forks)batch import
enhancementgood first issue

説明

See https://github.com/lucidrains/rotary-embedding-torch/blob/main/rotary_embedding_torch/rotary_embedding_torch.py

And from PaLM paper:

We use RoPE embeddings (Su et al., 2021) rather than absolute or relative position embeddings, since RoPE embeddings have been shown to have better performance on long sequence lengths.

コントリビューターガイド

Integrate LucidRain's RotaryEmbeddings · facebookresearch/metaseq#621 | Good First Issue