vllm-project/vllmPython
A high-throughput and memory-efficient inference and serving engine for LLMs
(80,034 stars) (16,816 forks) (61 indexed issues) (58 open good first issues)
Repositories
A high-throughput and memory-efficient inference and serving engine for LLMs