Repositórios

Repositórios de neuralmagic

A high-throughput and memory-efficient inference and serving engine for LLMs

Último commit 4 de set. de 2024

 (266 stars) (10 forks) (0 issues indexadas) (0 good first issues abertas)

A high-throughput and memory-efficient inference and serving engine for LLMs

Último commit 4 de jun. de 2026

 (17 stars) (7 forks) (0 issues indexadas) (0 good first issues abertas)