sgl-project/sglang

[Feature] Detailed Break down of time spend on Launching SGLang Diffusion

Open

Aperta il 20 feb 2026

Vedi su GitHub
 (10 commenti) (0 reazioni) (1 assegnatario)Python (28.442 star) (6216 fork)auto 404
good first issue

Descrizione

Checklist

Motivation

Diffusion and LLM have huge differences in compute characteristics. We want to have a detailed optimization of the launch time spent on SGLang Diffusion.

In this sense, to optimize the launch time, we should have a detailed breakdown of what is actually taking time when we launch our models. Please use Qwen-Image as an example, and try to break down the time spent. Then let's see whether we shall spend our time on optimize the launching time.

Related resources

No response

Guida contributor