neuralmagic Repositories

A high-throughput and memory-efficient inference and serving engine for LLMs

Letzter Commit 4. Sept. 2024

(266 Stars) (10 Forks) (0 indexierte Issues) (0 offene good first issues)

A high-throughput and memory-efficient inference and serving engine for LLMs

Letzter Commit 4. Juni 2026

(17 Stars) (7 Forks) (0 indexierte Issues) (0 offene good first issues)

Erhalte frische Easy Issues per E-Mail.