ChineseSpeechRecognitionDataset

清华大学CSLT发布的语音数据集

A Free Chinese Speech Corpus Released by CSLT@Tsinghua University

http://www.openslr.org/18/

包含来自855个发声人的102600条发音（总计长达500小时）的朗读数据集

A free Chinese Mandarin corpus by Surfingtech (www.surfing.ai), containing utterances from 855 speakers, 102600 utterances;

http://www.openslr.org/38/

包含长达100小时的语音数据集（Shanghai Primewords）

Chinese Mandarin corpus released by Shanghai Primewords Co. Ltd. (www.primewords.cn), containing 100 hours of speech data.

http://www.openslr.org/47/

包含来自600个发声人的总计长达200小时的朗读数据集（商汤科技）

A Chinese Mandarin speech corpus by Beijing DataTang Technology Co., Ltd, containing 200 hours of speech data from 600 speakers. The transcription accuracy for each sentence is larger than 98%.

http://www.openslr.org/62/

包含来自1080个发声人的总计长达755小时的朗读数据集（魔算数据）

The corpus by Magic Data Technology Co., Ltd. , containing 755 hours of scripted read speech data from 1080 native speakers of the Mandarin Chinese spoken in mainland China. The sentence transcription accuracy is higher than 98%.

http://www.openslr.org/68/

中文发音人识别数据集

A Free Chinese Speaker Recognition Corpus Released by CSLT@Tsinghua University

http://www.openslr.org/82/

中文热词检测数据集

Chinese hotwords detection dataset, provided by Mobvoi CO.,LTD

http://www.openslr.org/87/

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChineseSpeechRecognitionDataset

About

Releases

Packages

duwangthefirst/ChineseSpeechRecognitionDataset

Folders and files

Latest commit

History

Repository files navigation

ChineseSpeechRecognitionDataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages