sgl-project/sglang

[Tracking] Improve Multimodal CI coverage

Open

#8,496 opened on Jul 29, 2025

View on GitHub
 (3 comments) (5 reactions) (0 assignees)Python (6,216 forks)auto 404
Multi-modalcigood first issueperformance

Repository metrics

Stars
 (28,442 stars)
PR merge metrics
 (Avg merge 2d 1h) (1,000 merged PRs in 30d)

Description

Checklist

Motivation

  1. All existing Multi-modal CI operations are executed on a single GPU, without considering the scenario of Tensor Parallelism (TP). There is a requirement to introduce test cases for TP-2/4 configurations.
  2. The payload for VLM CI is relatively low. Stress tests are necessary, and simultaneously, it is crucial to investigate whether there are any memory leaks.
  3. At present, the majority of the VLM CI is only to vision. It is essential to expand its scope to audio.
  4. Welcome to add more.

Related resources

The test is under test/srt/test_vision_openai_server_x

Related PRs:

Contributor guide