pytorch/text

Ignoring UNK words

Open

#355 opened on Jul 22, 2018

View on GitHub
 (7 comments) (0 reactions) (0 assignees)Python (3,396 stars) (822 forks)batch import
enhancementhelp wanted

Description

Cannot find the way to ignore UNK words when numericalising, i.e. instead by substituting them by a 0, it just ignore that word.

Is that implemented?

This is useful in classification problems, when you just want to remove 'UNK' words.

Contributor guide