Skip to content

Latest commit

 

History

History
110 lines (97 loc) · 3.86 KB

speech_classification.md

File metadata and controls

110 lines (97 loc) · 3.86 KB

Speech Classification

Zalo AI Challenge: Voice Gender Classification

Identifying gender and regional accent from speech is essential for intelligent systems such as conversational chatbot, recommendation systems, smart home, and speech recognition. In this speech challenge, you will build a system to predict genders and regional accents of Vietnamese speakers using a diverse speech dataset. The dataset consists of ~30K short speech signals recorded in an un-controlled environment.

Leaderboard

Model Score Paper/Source Code
Private Test Public Test
VietAI 0.847 0.817 Video
Slide
AIS-HelloKitty 0.832 0.786
ZE 0.821 Official
pbcquoc 0.678 Official

Zalo AI Challenge: Music Genre Classification

Music Genre classification is a difficult and interesting challenge. A good classification is very helpful for smart music storage, music recommendation, music search. Despite of their usefulness, there are not many good music classifiers yet, especially for Vietnamese songs.

In this challenge, you are to build a classifier to detect the correct genre of a Vietnamese song. The 10 selected genres are: Cải Lương, Nhạc Cách Mạng, Nhạc Dân Ca - Quê Hương, Nhạc Dance, Nhạc Không Lời, Nhạc Thiếu Nhi, Nhạc Trịnh, Nhạc Trữ Tình, Rap Việt, Rock Việt.

A training set with labels is provided for your training. A test set with no category labels is also provided to test your trained classifiers against unseen data.

For each song, the classifier will need to output the most matching genre. Teams are scored and ranked by the classification accuracy on the test set.

Leaderboard

Model Score Paper/Source Code
Private Test Public Test
DungNB 0.701 0.815 Video Official
Batip 0.681 0.810 Official
toppan 0.652 0.802 Write up Official
ZE 0.782 Official