piskvorky/gensim

potential Doc2Vec feature: reverse inference, to synthesize doc/summary words

Open

#2,459 opened on Apr 21, 2019

View on GitHub
 (5 comments) (0 reactions) (0 assignees)Python (15,144 stars) (4,349 forks)batch import
Hacktoberfestdifficulty mediumfeaturegood first issuewishlist

Description

Motivated by the SO question: https://stackoverflow.com/questions/55768598/interpret-the-doc2vec-vectors-clusters-representation/55779049#55779049

Doc2Vec could plausibly have a function that's reverse-inference: take a doc-vector, return a (ranked) list of words most-predicted by that input vector. It'd work highly analogously to Word2Vec.predict_output_word(). Such a list of words might be useful as a sort-of summary or label for a doc-vector.

Contributor guide

potential Doc2Vec feature: reverse inference, to synthesize doc/summary words · piskvorky/gensim#2459 | Good First Issue