unslothai/unsloth

[Feature] Draft model / speculative decoding

Open

#4,753 创建于 2026年4月1日

在 GitHub 查看
 (4 评论) (2 反应) (0 负责人)Python (64,271 star) (5,658 fork)batch import
feature requestgood first issuehelp wanted

描述

Can we have the possibility to select draft model in ui? Seems like an important feature, I wonder how fast would Qwen3.5 27b be if I used Qwen3.5 0.8b as draft model.

贡献者指南

[Feature] Draft model / speculative decoding · unslothai/unsloth#4753 | Good First Issue