kaldi-asr/kaldi

Update single job scripts like utils/data/get_utt2dur.sh, steps/compute_cmvn_stats.sh to use multiple jobs

Open

#984 opened on Aug 20, 2016

View on GitHub
 (6 comments) (0 reactions) (0 assignees)Shell (15,392 stars) (5,359 forks)batch import
enhancementhelp wantedstale

Description

In the large data recipes in Kaldi many single job scripts in steps/* and utils/*, which were very fast for normal sized LVCSR tasks, consume several hours. e.g. on fisher_swbd/aspire the utils/data/get_utt2dur.sh script runs for a few hours.

We need to update these scripts to add parallelization support.

Required skill: shell scripting

Contributor guide