Sphinx OPS Aphasia Models

This is the main repository to build an acoustic model for sphinx based on the Open Speech Corpus Aphasia Corpus.

First execute the script download_word_recordings.py, this script will fetch all data from OPS.

Then execute the script convert_mp4_to_wav.py, to execute this script you must have FFMpeg installed and on your path.

After you need to prepare sphinx configuration data, to achieve this:

Then you can call the script configure_sphinx.py, this script will configure almost all the files required by sphinx, but to create a custom language model you need to execute generate_language_model.sh.

Make sure you have sphinxtrain installed on your pc

Now execute

sphinxtrain -t ops_aphasia setup

After this in your etc folder you will have a full structure or what you need for your project

Please check this link for further information.

Search for $CFG_HMM_TYPE and select .semi If you are on a multicore machine change $CFG_QUEUE_TYPE to Queue::POSIX and $CFG_NPART and $DEC_CFG_NPART to your machine cores

Then execute the train

sphinxtrain run

This could take some time.

To check the results

pocketsphinx_continuous -hmm model_parameters/ops_aphasia.ci_semi/ -lm etc/ops_aphasia.lm.DMP -dict etc/ops_aphasia.dic -inmic yes

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
common_filters.py		common_filters.py
configure_sphinx.py		configure_sphinx.py
convert_mp4_to_wav.py		convert_mp4_to_wav.py
corrupted_files.txt		corrupted_files.txt
download_word_recordings.py		download_word_recordings.py
generate_dict.py		generate_dict.py
generate_fileids.py		generate_fileids.py
generate_filler.py		generate_filler.py
generate_language_model.sh		generate_language_model.sh
generate_lm_transcription.py		generate_lm_transcription.py
generate_phone_set_from_dict.py		generate_phone_set_from_dict.py
generate_transcriptions.py		generate_transcriptions.py
ops_aphasia.html		ops_aphasia.html

contraslash/sphinx-ops-aphasia-models

Folders and files

Latest commit

History

Repository files navigation

Sphinx OPS Aphasia Models

About

Topics

Resources

Stars

Watchers

Forks

Languages