pytorch/text

Language modelling dataset only one sample

Open

#381 ouverte le 12 sept. 2018

Voir sur GitHub
 (2 commentaires) (0 réactions) (0 assignés)Python (3 396 stars) (822 forks)batch import
datasetshelp wanted

Description

from torchtext import data
from torchtext import datasets

TEXT = data.Field(lower=True, batch_first=True)

train, valid, test = datasets.WikiText2.splits(TEXT)

print('len(train)', len(train))

This returns a length of one. It should print the length of the whole dataset. I have tried both with version 0.2.3 and 0.3 and none of them worked.

Guide contributeur