Issues du dépôt
epwalsh/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
Issues
Aucune issue indexée ouverte trouvée pour ce dépôt.