unslothai/unsloth

[Feature] Draft model / speculative decoding

Open

#4,753 opened on 2026年4月1日

GitHub で見る
 (4 comments) (2 reactions) (0 assignees)Python (64,271 stars) (5,658 forks)batch import
feature requestgood first issuehelp wanted

説明

Can we have the possibility to select draft model in ui? Seems like an important feature, I wonder how fast would Qwen3.5 27b be if I used Qwen3.5 0.8b as draft model.

コントリビューターガイド

[Feature] Draft model / speculative decoding · unslothai/unsloth#4753 | Good First Issue