sgl-project/sglang

[Feature] Simplify `tree_speculative_sampling_target_only`

Open

Aperta il 3 nov 2025

Vedi su GitHub
 (2 commenti) (0 reazioni) (0 assegnatari)Python (28.442 star) (6216 fork)auto 404
good first issue

Descrizione

Checklist

Motivation

Some TODOs for newcomers to SGLang

  • Remove passing argument draft_probs
  • Add a more efficient kernel (torch/triton/cuda) for topk=1 special case.

Related resources

No response

Guida contributor