[Feature] Detailed Break down of time spend on Launching SGLang Diffusion · sgl-project/sglang#19087

(10 commenti) (0 reazioni) (1 assegnatario)Python (6216 fork)auto 404

good first issue

Metriche repository

Star: (28.442 star)
Metriche merge PR: (Merge medio 2g 1h) (1000 PR mergiate in 30 g)

Descrizione

Checklist

If this is not a feature request but a general question, please start a discussion at https://github.com/sgl-project/sglang/discussions. Otherwise, it will be closed.
Please use English. Otherwise, it will be closed.

Motivation

Diffusion and LLM have huge differences in compute characteristics. We want to have a detailed optimization of the launch time spent on SGLang Diffusion.

In this sense, to optimize the launch time, we should have a detailed breakdown of what is actually taking time when we launch our models. Please use Qwen-Image as an example, and try to break down the time spent. Then let's see whether we shall spend our time on optimize the launching time.

Related resources

No response

Guida contributor

Direzione di ricerca: Analizza il tempo di avvio di SGLang Diffusion usando Qwen Image come esempio. Suddividi il tempo in componenti come caricamento del modello, inizializzazione dei pesi, compilazione e altri passaggi di inizializzazione. Identifica i colli di bottiglia e riporta i risultati.
Tech stack: python
Dominio: backendmachine learningperformance
Tipo issue: Funzionalità
Difficoltà: 3
Tempo stimato: Mezza giornata
Stato attività: Attiva
Chiarezza: Chiara
Prerequisiti: Python
Adatta ai principianti: 60