Deep Learning GPU Training System
Repositories
NVIDIA Repositories
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Style transfer, deep learning, feature transform
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.
Ongoing research training transformer models at scale
A toolkit for processing speech data and creating speech datasets
NeMo text processing for ASR and TTS
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
OpenShell is the safe, private runtime for autonomous AI agents.
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
NVIDIA Fleet Intelligence Agent - Host agent for GPU telemetry collection and attestation
Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
NVIDIA Linux open GPU kernel module source
Synthesizing and manipulating 2048x1024 images with conditional GANs
Fast and accurate object detection with end-to-end GPU optimization
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.