facebookresearch/metaseq

Integrate LucidRain's RotaryEmbeddings

Open

#621 创建于 2023年1月27日

在 GitHub 查看
 (2 评论) (0 反应) (0 负责人)Python (6,195 star) (701 fork)batch import
enhancementgood first issue

描述

See https://github.com/lucidrains/rotary-embedding-torch/blob/main/rotary_embedding_torch/rotary_embedding_torch.py

And from PaLM paper:

We use RoPE embeddings (Su et al., 2021) rather than absolute or relative position embeddings, since RoPE embeddings have been shown to have better performance on long sequence lengths.

贡献者指南