This repository is now deprecated in favour of [Elpis](https://github.com/CoEDL/elpis).
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition toolkit.
This pipeline relies on Python 3.6 and several open-source Python packages (listed here). It also assumes you have Kaldi, sox and task installed.
This library uses the task tool to run the more complex processes automatically. Once you've set up Kaldi Helpers, you can run the various pipeline tasks we've developed. Read the Taskfile for more information about the available tasks.