Repositories

neuralmagic Repositories

A high-throughput and memory-efficient inference and serving engine for LLMs

Letzter Commit 4. Sept. 2024

 (266 Stars) (10 Forks) (0 indexierte Issues) (0 offene good first issues)

A high-throughput and memory-efficient inference and serving engine for LLMs

Letzter Commit 4. Juni 2026

 (17 Stars) (7 Forks) (0 indexierte Issues) (0 offene good first issues)