facebookresearch/metaseq

Remove Megatron dependency - move entirely to Fairscale

Open

#616 opened on Jan 24, 2023

View on GitHub
 (1 comment) (0 reactions) (1 assignee)Python (6,195 stars) (701 forks)batch import
better-enggood first issue

Description

This is to look into whether or not we can remove our Megatron dependency and rely entirely on our Fairscale dependency (model parallelism implementation seems to be identical between the two).

Contributor guide