pytorch/text

Vocab vectors using complete pretrained-embedding?

Open

#446 创建于 2018年10月12日

在 GitHub 查看
 (6 评论) (0 反应) (0 负责人)Python (3,396 star) (822 fork)batch import
enhancementhelp wanted

描述

I am new to pytorch and nlp. I have a question when I tried to build a model.

Since my training dataset is not so big, the size of its vocab is relatively small (around 5000). However, I want to deal with any other user input which could be out of this vocabulary.

The problem is, in the model I trained, the embedding layer's weight is based on the vectors of the field, not the whole word2vec pretrained embeddings. So I cannot modified it after the training is done.

I wondered is there any better approach to do it? Thanks in advance!

贡献者指南