sgl-project/sglang
View on GitHub[Feature] Profile the update weights from disk API of SGLang Diffusion
Open
#18,979 opened on Feb 18, 2026
good first issue
Description
Checklist
- If this is not a feature request but a general question, please start a discussion at https://github.com/sgl-project/sglang/discussions. Otherwise, it will be closed.
- Please use English. Otherwise, it will be closed.
Motivation
As we did in this comment:
https://github.com/sgl-project/sglang/pull/18306/#issuecomment-3898841774
We should profile the actual time breakdown in the update weights from disk.
Ideally speaking, 7B models' update should be within 1s (no considering save to disk time) in this https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/sglang/latency-accelerate-for-weight-updates/readme.md
Related resources
No response