sgl-project/sglang

[Feature] Detailed Break down of time spend on Launching SGLang Diffusion

Open

#19,087 opened on Feb 20, 2026

View on GitHub
 (10 comments) (0 reactions) (1 assignee)Python (28,442 stars) (6,216 forks)auto 404
good first issue

Description

Checklist

Motivation

Diffusion and LLM have huge differences in compute characteristics. We want to have a detailed optimization of the launch time spent on SGLang Diffusion.

In this sense, to optimize the launch time, we should have a detailed breakdown of what is actually taking time when we launch our models. Please use Qwen-Image as an example, and try to break down the time spent. Then let's see whether we shall spend our time on optimize the launching time.

Related resources

No response

Contributor guide