[Feature Request] 更灵活的视觉模型判别 · ChatGPTNextWeb/NextChat#5843 | Good First Issue

(4 comments) (1 reaction) (0 assignees)TypeScript (59,717 forks)batch import

enhancementgood first issuehelp wanted

Repository metrics

Stars: (87,992 stars)
PR merge metrics: (No merged PRs in 30d)

Description

🥰 需求描述

当前项目采用固定的关键词、排除关键词的方案进行视觉模型判别（isVisionModel），加上各模型厂商并没有采取一致的命名方案，导致模型视觉判别滞后和频繁修改，如最新的 gemini-exp-1114 也支持视觉能力了，但是当前的视觉判别不能直接适配，急需优化更灵活的视觉模型判别方法

🧐 解决方案

可能的解决方案：

允许通过环境变量给指定的模型加上视觉能力，如: VisionModel=model_1,model_2,model_3
允许前端网页配置、后台解析支持视觉能力的模型
...

📝 补充信息

No response

Contributor guide

Research direction: Examine the current isVisionModel function and propose a configuration system (e.g., environment variable or UI) to allow users to specify which models have vision capabilities.
Tech stack: typescriptreactnextjs
Domain: frontendfull stack
Issue type: Feature
Difficulty: 2
Estimated time: Half day
Activity status: Active
Clarity: Clear
Prerequisites: GitTypeScript
Newbie friendliness: 70