tenstorrent/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

PythonStars 26Forks 14Watchers 26Open issues 16License Apache License 2.0
Details
仓库信息
Ownertenstorrent
Last pushed2025-12-13
Last updated2025-12-14
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--