Skip to content

Nexdata-AI/302-Person-Hindi-and-English-Bilingual-Spontaneous-Monologue-smartphone-speech-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

302-Person-Hindi-and-English-Bilingual-Spontaneous-Monologue-smartphone-speech-dataset

Description

Hindi and English Bilingual Spontaneous Monologue smartphone speech dataset, collected from dialogues based on given topics, covering generic domain. Our dataset was collected from extensive and diversify speakers(302 people in total, ages 18 to 46), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied. For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1420?source=Github

Format

16k Hz, 16 bit, wav, mono channel

Content category

Individuals naturally speaking, with no specific content limitations. Each speaker records 20 audios in each language (40 recordings per person), each recording lasting about 10-20 seconds

Recording condition

Quiet indoor environment, without echoes, background voices, obvious noises

Recording device

Android phone, iPhone

Speaker

Total 302 contributors,45% male and 55% female. 291contributors aged 18-37, 10 contributors aged 38-45, and 1 contributor aged 46-65

Country

India(IND)

Language

Hindi,English

Licensing Information

Commercial License