Automatic Speech Recognition

Implementation of models for the Automatic Speech Recognition problem.

QuartzNet with (BxS)xR architecture:

Deepspeech:

Notebook

Getting Started

Clone the repository and step into it:

git clone https://github.com/khaykingleb/ASR.git
cd ASR

Install requirements and modules.

pip install -r requirements.txt
python setup.py install

Use for training:

python train.py -c configs/cofig_name.json

Use for testing:

python test.py \
      -c default_test_model/config.json \
      -r default_test_model/checkpoint.pth \
      -o result.json

Please, note that for testing the model you need to specify the dataset in test.py, for instance LibrispeechDataset:

config.config["data"] = {
        "test": {
            "batch_size": args.batch_size,
            "num_workers": args.jobs,
            "datasets": [
                {
                    "type": "LibrispeechDataset",
                    "args": {
                        "part": "test-clean"
                    }
                }
            ]
        }
    }

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
asr		asr
configs		configs
default_test_model		default_test_model
img		img
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

asr

configs

configs

default_test_model

default_test_model

img

img

notebooks

notebooks

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

test.py

test.py

train.py

train.py

Repository files navigation

Automatic Speech Recognition

Notebook

Getting Started

Data Used

About

Releases

Packages

Languages

License

khaykingleb/Automatic-Speech-Recognition

Folders and files

Latest commit

History

Repository files navigation

Automatic Speech Recognition

Notebook

Getting Started

Data Used

About

Topics

Resources

License

Stars

Watchers

Forks

Languages