NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

PythonStars 2999Forks 576Watchers 2999Open issues 377License Apache License 2.0

Details

仓库信息

OwnerNVIDIA

Homepagehttps://docs.nvidia.com/deeplearning/transformer-engine/user-guide/index.html

GitHubhttps://github.com/NVIDIA/TransformerEngine

Last pushed2025-12-12

Last updated2025-12-14

Issues fetched at—

NVIDIA/TransformerEngine

Community at a glance