Repositórios de neuralmagic

A high-throughput and memory-efficient inference and serving engine for LLMs

Último commit 4 de set. de 2024

(266 stars) (10 forks) (0 issues indexadas) (0 good first issues abertas)

A high-throughput and memory-efficient inference and serving engine for LLMs

Último commit 4 de jun. de 2026

(17 stars) (7 forks) (0 issues indexadas) (0 good first issues abertas)

Receba issues Easy novas por email.