piskvorky/gensim

potential Doc2Vec feature: reverse inference, to synthesize doc/summary words

Open

#2,459 创建于 2019年4月21日

在 GitHub 查看
 (5 评论) (0 反应) (0 负责人)Python (15,144 star) (4,349 fork)batch import
Hacktoberfestdifficulty mediumfeaturegood first issuewishlist

描述

Motivated by the SO question: https://stackoverflow.com/questions/55768598/interpret-the-doc2vec-vectors-clusters-representation/55779049#55779049

Doc2Vec could plausibly have a function that's reverse-inference: take a doc-vector, return a (ranked) list of words most-predicted by that input vector. It'd work highly analogously to Word2Vec.predict_output_word(). Such a list of words might be useful as a sort-of summary or label for a doc-vector.

贡献者指南