Repository Issues

ApsarasX/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Stars
 (0 stars)
Forks
 (0 forks)
Indexed issues
 (0 indexed issues)
open beginner issues
 (0 open beginner issues)
Latest indexed
Jun 13, 2026
Last GitHub push
Dec 15, 2023
Contributing guide
No contributing guide
Code of conduct
No code of conduct
Dominant language
C++
PR merge metrics
 (No merged PRs in 30d)
Beginner labels
No beginner labels indexed

Issues

0 open indexed issues

No open indexed issues found for this repository.