Skip to content

xettrisomeman/Nepali-Music-Genre-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BI-LSTM Based Nepali Music Genre Classification

Introduction

Code for classifying music genre using Bidirectional LSTM.

Methods

  1. Applied 13 MFCCs bands and saved the results in json
  2. Created the model using 32 hidden neurons, output (4 neurons)
  3. Adam with default learning rate and CrossEntropyLoss is used.
  4. Trained the model for 50 epochs.

The data is not saved as png file because we can encounter losses of data and change in information in the process. Spectrograms cannot be represented as images.

Datasets

There is no audio dataset publicly available, so the data had to be created manually. There is about 400 samples of music , each 30 seconds of length. For searching the song youtube is used and for trimming the audio audacity (Open Source software) is used. For training 75% of data is used and for testing remaining 25% is used.

Results

Losses Comparison

Accuracy Comparison

Make Prediction

git clone -b test https://github.com/xettrisomeman/Nepali-Music-Genre-Classification
cd Nepali-Music-Classification
pip install -r requirements.txt
cd genreclassify
python predict.py --help

References

About

BI-LSTM Based Music Genre Classification

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages