Skip to content

Latest commit

 

History

History
63 lines (48 loc) · 2.96 KB

automatic_speech_recognition.md

File metadata and controls

63 lines (48 loc) · 2.96 KB

Automatic Speech Recognition

VLSP 2018 Shared Task: Automatic Speech Recognition

In the ASR task, participants were asked to transcribe automatically Vietnamese audio files into the spoken word sequences. The committee provided the test set only, while the training data for the acoustic and language models was developed by the teams themselves.

The test set was composed of 796 continuous wav files of news speech for a total duration of two hours, without any information on the sentence segmentation. The speech was recorded in a non-noisy environment, and available in three dialects: Northern, Southern and Central with respectively proportion of 50%, 40% and 10%.

Leaderboard

Model Score Paper/Source Code
WER SER
VAIS 6.29 75.50 Do et al. VLSP'18
Viettel-CSC 7.40 75.38 Nguyen et al. VLSP'18

Miscellaneous

📜 Papers

💫 Libraries

  • 2021, vietai/ASR - Vietnamese end-to-end speech recognition using wav2vec 2.0

💫 Services

📁 Dataset