bigscience-workshop/Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

PythonStars 1425Forks 228Watchers 1425Open issues 123License Other
Details
仓库信息
Ownerbigscience-workshop
Homepage
Last pushed2024-03-20
Last updated2025-12-13
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--