Interspeech2020-Accented-English-Speech-Recognition-Competition-Data

Description

Interspeech2,020 Accented English Speech Recognition Competition Data. The text has been proofread manually with high accuracy; this data set can be used for automatic speech recognition, machine translation, and voiceprint recognition.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1169?source=Github

Format

16kHz, 16bit, uncompressed wav, mono channel.

Recording environments

quiet indoor environment, without echo.

Recording content (read speech)

generic category; human-machine interaction category; smart home command and control category; in-car command and control category; numbers.

Demographics

Train set: There are 440 people coming from eight different countries; Test set: There are 3,207 people coming from ten different countries.

Device

Android mobile phone, iPhone.

Language

English

Applications

speech recognition; voiceprint recognition.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
G00988S1147.txt		G00988S1147.txt
G00988S1147.wav		G00988S1147.wav
G01377S1014.txt		G01377S1014.txt
G01377S1014.wav		G01377S1014.wav
G01612S1141.txt		G01612S1141.txt
G01612S1141.wav		G01612S1141.wav
G10180S1142.txt		G10180S1142.txt
G10180S1142.wav		G10180S1142.wav
G10241S1030.txt		G10241S1030.txt
G10241S1030.wav		G10241S1030.wav
G10416S2287.txt		G10416S2287.txt
G10416S2287.wav		G10416S2287.wav
README.md		README.md

Nexdata-AI/Interspeech2020-Accented-English-Speech-Recognition-Competition-Data

Folders and files

Latest commit

History

Repository files navigation

Interspeech2020-Accented-English-Speech-Recognition-Competition-Data

Description

Format

Recording environments

Recording content (read speech)

Demographics

Device

Language

Applications

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks