Repository Issues

NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

Stars
 (13,823 stars)
Forks
 (2,445 forks)
Indexed issues
 (0 indexed issues)
open beginner issues
 (0 open beginner issues)
Latest indexed
Not indexed yet
Last GitHub push
Jun 7, 2026
License
No license data
Contributing guide
No contributing guide
Code of conduct
No code of conduct
Dominant language
Python
PR merge metrics
 (PR metrics pending)
Beginner labels
No beginner labels indexed

Issues

0 open indexed issues

No open indexed issues found for this repository.