pytorch/text

Language modelling dataset only one sample

Open

#381 opened on Sep 12, 2018

View on GitHub
 (2 comments) (0 reactions) (0 assignees)Python (3,396 stars) (822 forks)batch import
datasetshelp wanted

Description

from torchtext import data
from torchtext import datasets

TEXT = data.Field(lower=True, batch_first=True)

train, valid, test = datasets.WikiText2.splits(TEXT)

print('len(train)', len(train))

This returns a length of one. It should print the length of the whole dataset. I have tried both with version 0.2.3 and 0.3 and none of them worked.

Contributor guide