codertimo/BERT-pytorch

Making Wikipedia Corpus

Open

#42 opened on Oct 30, 2018

View on GitHub
 (1 comment) (2 reactions) (0 assignees)Python (5,757 stars) (1,252 forks)batch import
help wanted

Description

Building the same corpus with original paper. Please share your tips to preprocess and download the file. It would be great to share preprocessed data using dropbox or google drive etc.

Contributor guide