piskvorky/gensim

potential Doc2Vec feature: reverse inference, to synthesize doc/summary words

Open

#2,459 建立於 2019年4月21日

在 GitHub 查看
 (5 留言) (0 反應) (0 負責人)Python (15,144 star) (4,349 fork)batch import
Hacktoberfestdifficulty mediumfeaturegood first issuewishlist

描述

Motivated by the SO question: https://stackoverflow.com/questions/55768598/interpret-the-doc2vec-vectors-clusters-representation/55779049#55779049

Doc2Vec could plausibly have a function that's reverse-inference: take a doc-vector, return a (ranked) list of words most-predicted by that input vector. It'd work highly analogously to Word2Vec.predict_output_word(). Such a list of words might be useful as a sort-of summary or label for a doc-vector.

貢獻者指南