bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Details
仓库信息
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Stats
Loading...
Loading
--
Loading
--
Loading
--
Loading
--