call for contributionhelp wanted
Description
Integrate it with main stream models: https://github.com/apple/ml-cross-entropy so that model with large vocab size uses much less memory
Integrate it with main stream models: https://github.com/apple/ml-cross-entropy so that model with large vocab size uses much less memory