FMInference/FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

PythonStars 9381Forks 587Watchers 9381Open issues 58License Apache License 2.0
Details
仓库信息
OwnerFMInference
Homepage
Last pushed2024-10-28
Last updated2025-12-14
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--