Speech Data Processor

Speech Data Processor (SDP) is a toolkit to make it easy to:

Write code to process a new dataset, minimizing the amount of boilerplate code required.
Share the steps for processing a speech dataset.

SDP's philosophy is to represent processing operations as 'processor' classes, which take in a path to a NeMo-style data manifest as input (or a path to the raw data directory if you do not have a NeMo-style manifest to start with), apply some processing to it, and then save the output manifest file.

You specify which processors you want to run using a YAML config file. Many common processing operations are provided, and it is easy to add your own.

To learn more about SDP, have a look at our documentation.

Installation

SDP is officially supported for Python 3.10, but might work for other versions.

To install all required dependencies run pip install -r requirements/main.txt. You will need to install additional requirements if you want to run tests or build documentation.

Some SDP processors depend on the NeMo toolkit (ASR, NLP parts) and NeMo Text Processing. Please follow NeMo installation instructions and NeMo Text Processing installation instructions if you need to use such processors.

Contributing

We welcome community contributions! Please refer to the CONTRIBUTING.md for the process.

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
.github/workflows		.github/workflows
dataset_configs		dataset_configs
docs		docs
requirements		requirements
sdp		sdp
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
main.py		main.py
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

dataset_configs

dataset_configs

docs

docs

requirements

requirements

sdp

sdp

tests

tests

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

CONTRIBUTING.md

CONTRIBUTING.md

LICENSE

LICENSE

README.md

README.md

init.py

init.py

main.py

main.py

pytest.ini

pytest.ini

Repository files navigation

Speech Data Processor

Installation

Contributing

About

Releases

Packages

Contributors 5

Languages

License

NVIDIA/NeMo-speech-data-processor

Folders and files

Latest commit

History

Repository files navigation

Speech Data Processor

Installation

Contributing

About

Resources

License

Stars

Watchers

Forks

Languages