mozilla/DeepSpeech

CC PMF importer improvements

Open

#3,450 opened on Dec 2, 2020

View on GitHub
 (2 comments) (0 reactions) (0 assignees)C++ (26,755 stars) (4,093 forks)batch import
good first bughelp wanted

Description

The current code does not work well:

  • extraction fails because Python zip does not support multiarchive, we should rely on 7z for example
  • we should expose MIN_SECS / MAX_SECS to the command-line arguments
  • document ffmpeg dependency for audio conversion
  • we use multiprocess for performing one WAV split, but maybe we should also use multiprocess for converting multiple MP3 to WAV in paralell

Contributor guide

CC PMF importer improvements · mozilla/DeepSpeech#3450 | Good First Issue