![]() Welsh and Irish Speech Processing Resources |
![]() ![]() ![]() ![]() ![]() ![]() |
Author: | Ivan A. Uemlianin |
---|---|
Contact: | i.uemlianin@bangor.ac.uk |
Copyright: | 2006, University of Wales, Bangor |
The only requirements are python, HTK and SpeechCluster. Note that registration is required for HTK, but the software is free of charge and models built with HTK can be commercialised.
pyhtk can be downloaded from here. The individual components can be downloaded from here.
A list of useful feature adds in no particular order:
Typing pyhtk.py -h will get the following usage summary:
pyhtk Usage summary
===================
pyhtk.py [options]
Option Function
=========== ========
-s Sets up for building an acoustic model
-b Sets up and builds an acoustic model
-a <hmmdir> Forced alignment using the hmm in <hmmdir>
-c1 Clean up log and model files ready for another build
-c2 Clean up everything except model files
$ mkdir <myHTKProject>
$ cp pyhtk.py <myHTKProject>
$ cd <myHTKProject>
$ mkdir wav
$ cp /path/to/wav/files wav
$ mkdir lab
$ cp /path/to/lab/files labn.b.:
- The wav files can be any sample-rate you like, as long as they're all the same.
- The lab files can be any format (at least any format supported by SpeechCluster; see the SpeechCluster docs for details, but it includes esps, TextGrid and htk-lab formats); they don't all have to be the same format.
Run pyhtk.py. These are the options:
This will set things up, but not actually build the AM. This is useful if you want to check your setup.
This will set things up and build the model.
Logfiles in html format are saved in log/. There are tables of contents, and errors are highlighted in red.
TODO: What is forced alignment?
$ cp pyhtk.py <myHTK_AM>
$ cd <myHTK_AM>
$ mkdir wav
$ cp /path/to/wav/files wav
$ mkdir lab
$ cp /path/to/lab/files labn.b.:
- The wav files can be any sample-rate you like, as long as they're all the same.
- The lab files can be any format (at least any format supported by SpeechCluster; see the SpeechCluster docs for details, but it includes esps, TextGrid and htk-lab formats); they don't all have to be the same format.
I haven't tested this on Windows, but pyhtk.py should work there as long as:
On Windows it should even be pointyclicky: double-clicking pyhtk.py should run it. If you rename it to pyhtk.pyw, you won't get the annoying Command Prompt window that pops up otherwise.