MaartenGr/BERTopic

Better Topic Modeling Visualization/Saving for Image Only Analysis

Open

#2,236 创建于 2024年12月6日

在 GitHub 查看
 (7 评论) (0 反应) (0 负责人)Python (634 fork)batch import
enhancementgood first issue

仓库指标

Star
 (5,074 star)
PR 合并指标
 (30 天内没有已合并 PR)

描述

Feature request

I would like to see a swipe gallery of each topic. Each image would contain the top representative images of that topic in a grid (similar to what is shown at the end of the process described here: https://maartengr.github.io/BERTopic/getting_started/multimodal/multimodal.html#images-only) with the captions underneath. I would like the option to create the grid with the original images since the images right now are low quality in the grid. If creating a swipe gallery is too much, something that just saves the topics in a folder as 1_keyword1_keyword2_keyword3_keyword_4.jpg also works.

Motivation

Currently, at the end of the image modeling process (https://maartengr.github.io/BERTopic/getting_started/multimodal/multimodal.html#images-only), there is a grid that shows the list of keywords and the grid. This works well for exploratory analysis but is hard to display later. Right now, there is no way to display the results in a better way. In addition, even saving the grid image is inefficient because it is generated from the compressed images rather than the original.

Your contribution

I have created a save captions and grid function I could add, but I don't think it will help with the larger issue of not being able to utilize the original images to create a grid as I don't have the original dataset.

贡献者指南