Skip to content

Nexdata-AI/120-Hours-Burmese-Conversational-Speech-Data-by-Mobile-Phone

Repository files navigation

120-Hours-Burmese-Conversational-Speech-Data-by-Mobile-Phone

Description

The 120 Hours - Burmese Conversational Speech Data involved more than 130 native speakers, developed with proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices are various mobile phones. The audio format is 16kHz, 16bit, uncompressed WAV, and all the speech data was recorded in quiet indoor environments. All the speech audio was manually transcribed with text content, the start and end time of each effective sentence, and speaker identification.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1207?source=Github

Specifications

Format

16kHz 16bit, uncompressed wav, mono channel;

Environment

quiet indoor environment, without echo;

Recording content

dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;

Demographics

134 speakers totally, with 50% male and 50% female

Annotation

annotating for the transcription text, speaker identification and gender

Device

Android mobile phone, iPhone;

Language

Burmese;

Application scenarios

speech recognition; voiceprint recognition;

Accuracy rate

the word accuracy rate is not less than 97%

Licensing Information

Commercial License