[Tracking] Improve Multimodal CI coverage · sgl-project/sglang#8496

2025-07-29T04:39:17.000Z

### Checklist - [ ] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed. - [ ] 2. Please use English, otherwise it will be closed. ### Motivation 1. All existing Multi-modal CI operations are executed on a single GPU, without considering the scenario of Tensor Parallelism (TP). There is a requirement to introduce test cases for TP-2/4 configurations. 2. The payload for VLM CI is relatively low. Stress tests are necessary, and simultaneously, it is crucial to investigate whether there are any memory leaks. 3. At present, the majority of the VLM CI is only to vision. It is essential to expand its scope to audio. 4. Welcome to add more. ### Related resources The test is under `test/srt/test_vision_openai_server_x` Related PRs: - https://github.com/sgl-project/sglang/pull/8428 - https://github.com/sgl-project/sglang/pull/7519

(3 留言) (5 反應) (0 負責人)Python (6,216 fork)auto 404

Multi-modalcigood first issueperformance

倉庫指標

Star: (28,442 star)
PR 合併指標: (平均合併 2天 1小時) (30 天內合併 1,000 個 PR)

描述

Checklist

1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
2. Please use English, otherwise it will be closed.

Motivation

All existing Multi-modal CI operations are executed on a single GPU, without considering the scenario of Tensor Parallelism (TP). There is a requirement to introduce test cases for TP-2/4 configurations.
The payload for VLM CI is relatively low. Stress tests are necessary, and simultaneously, it is crucial to investigate whether there are any memory leaks.
At present, the majority of the VLM CI is only to vision. It is essential to expand its scope to audio.
Welcome to add more.

Related resources

The test is under test/srt/test_vision_openai_server_x

Related PRs:

貢獻者指南

研究方向: 查看 test/srt/test vision openai server x 下的現有 VLM CI 測試，識別 TP 配置、壓力測試、音訊測試方面的不足，並提出或實作測試用例。參考相關 PR #8428 和 #7519。
技術棧: python
領域: devops
議題類型: 測試
難度: 3
預計時間: 超過 1 週
活動狀態: 活躍
清晰度: 清晰
前置要求: Git
新手友善度: 65

倉庫指標

描述

Checklist

Motivation

Related resources

貢獻者指南

每天在信箱收到新鮮 Easy issues。