310-Hours-Turkish-Scripted-Monologue-Smartphone-Speech-Dataset

Description

Turkish Scripted Monologue Smartphone Speech Dataset, collected from monologue based on given scripts. Transcribed with text content. Our dataset was collected from extensive and diversify speakers(223 people in total, from turkey), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1446?source=Github

Format

16kHz, 16bit, uncompressed wav, mono channel.

Recording condition

quiet indoor environment, low background noise, without echo;

Recording device

Android smartphone, iPhone;

Speaker

223 native speakers in total, 54% male and 46% female;

Country

Turkey(TUR);

Language(Region) Code

tr-TR;

Language

Turkish;

Features of annotation

Transcription text;

Accuracy Rate

Word Accuracy Rate (WAR) 95%;

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
G01001S0001.txt		G01001S0001.txt
G01001S0001.wav		G01001S0001.wav
G01001S0002.txt		G01001S0002.txt
G01001S0002.wav		G01001S0002.wav
G01001S0007.txt		G01001S0007.txt
G01001S0007.wav		G01001S0007.wav
G01004S0001.txt		G01004S0001.txt
G01004S0001.wav		G01004S0001.wav
G01004S0005.txt		G01004S0005.txt
G01004S0005.wav		G01004S0005.wav
G01004S0007.txt		G01004S0007.txt
G01004S0007.wav		G01004S0007.wav
README.md		README.md

Nexdata-AI/310-Hours-Turkish-Scripted-Monologue-Smartphone-Speech-Dataset

Folders and files

Latest commit

History

Repository files navigation

310-Hours-Turkish-Scripted-Monologue-Smartphone-Speech-Dataset

Description

Format

Recording condition

Recording device

Speaker

Country

Language(Region) Code

Language

Features of annotation

Accuracy Rate

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks