NVIDIA-NeMo/NeMo

NGPU-LM with ONNX model inference

Open

#14,501 opened on Aug 18, 2025

View on GitHub
 (0 comments) (0 reactions) (1 assignee)Python (3,421 forks)github user discovery
ASRhelp wanted

Repository metrics

Stars
 (17,298 stars)
PR merge metrics
 (Avg merge 12d) (49 merged PRs in 30d)

Description

How can I perform inference with my ONNX-exported Fast-Conformer model using CTC decoding and NGPU-LM beam search?

Contributor guide