NGPU-LM with ONNX model inference · NVIDIA-NeMo/NeMo#14501

(0 comments) (0 reactions) (1 assignee)Python (3,421 forks)github user discovery

ASRhelp wanted

Repository metrics

How can I perform inference with my ONNX-exported Fast-Conformer model using CTC decoding and NGPU-LM beam search?

Research direction: Look for examples of ONNX model inference with CTC decoder and NGPU LM beam search in NeMo; check documentation for NGPU LM integration with ONNX models; trace the inference pipeline to identify configuration steps.
Tech stack: python
Domain: machine learningai
Issue type: Research
Difficulty: 3
Estimated time: 1-3 hours
Activity status: Active
Clarity: Mostly clear
Prerequisites: PythonONNXCTC decoding
Newbie friendliness: 60