sgl-project/sglang

[Feature] Profile the update weights from disk API of SGLang Diffusion

Open

#18,979 opened on Feb 18, 2026

View on GitHub
 (3 comments) (0 reactions) (0 assignees)Python (28,442 stars) (6,216 forks)auto 404
good first issue

Description

Checklist

Motivation

As we did in this comment:

https://github.com/sgl-project/sglang/pull/18306/#issuecomment-3898841774

We should profile the actual time breakdown in the update weights from disk.

Ideally speaking, 7B models' update should be within 1s (no considering save to disk time) in this https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/sglang/latency-accelerate-for-weight-updates/readme.md

Related resources

No response

Contributor guide