AIOLI-MUSIC:

This README file contains information about the music-related resources and work done in the AIOLI.

TODO

THIS IS GOING TO HAPPEN

Initial presentation on a SIMPLE CLASSIFIER USING DIFFERENT ML FRAMEWORKS, 29th November 2017 (Joshi, Kyle, Noah, Andres, ...)
...?
Profit!

THIS COULD HAPPEN AND WOULD BE AWESOME

Music denoising using autoencoders https://github.com/spmallick/learnopencv/blob/master/DenoisingAutoencoder/Denoising-Autoencoder-using-Tensorflow.ipynb
Source separation using autoencoders
Emulate Spotify's recommender system using CNNs: https://hackernoon.com/spotifys-discover-weekly-how-machine-learning-finds-your-new-music-19a41ab76efe
Music style transfer (decomposition using wavelets, recomposition using SuperCollider: http://supercollider.github.io/, mapping using some unsupervised/clustering algorithms)

1. DATASETS

This is a selection of the most popular datasets. Most of them contain "songs" as understood in the context of For a single-label, small setup to train some music classification supervised algos, both the GTZAN dataset and the FMA-small version are suitable. The FMA has less research on it (AFAIK), but the GTZAN lacks licensing.

For a big-scale, multi-label project, the MSD is the absolute reference but also receives critique for not holding the plain audio files. The magnatagatune dataset has some research done on it and seems a good choice.

Some may have a normalized dabase system avaliable and precomputed features, but I think it is especially interesting to get the raw musical data and get through step of the pipeline from there.

FREE MUSIC ARCHIVE:

CC-inspired license
four versions (from 8K 30sec tracks on 8 balanced genres to 100K full tracks on 161 unbalanced genres)
audio as well as higher-level features
repo with supporting code in Python2

paper: https://arxiv.org/abs/1612.01840 code and downloads: https://github.com/mdeff/fma more info: https://freemusicarchive.org/api

GTZAN DATASET:

1000 audio tracks of 30 seconds each
10 genres, each 100 tracks
22050Hz mono, 16-bit .wav files
presumably no license given (but not needed for 30sec?)
much research done on it webpage and downloads: http://marsyasweb.appspot.com/download/data_sets/

Note: the web has also a speech vs. music dataset avaliable

MAGNATAGATUNE DATASET:

over 25k 30s chunks of mp3 files
multi-label features over 220 categories (water, english, upbeat, quick...)

webpage and downloads: http://mirg.city.ac.uk/codeapps/the-magnatagatune-dataset

MILLION SONG DATASET:

mid- and high-level features of a million whole songs
no audio, but open license (many of them are "commercially relevant")
lots of research done on it

CALAB DATASET:

over 10,000 songs performed by 4,597 different artists, weakly labeled from a vocabulary of over 500 tags
song-tag associations are mined from Pandora's website
specific format and length for the contents +licensing unclear, read http://calab1.ucsd.edu/~datasets/cal500/details_cal500.txt

downloads: http://calab1.ucsd.edu/~datasets/

2. AUDIO PREPROCESSING IN PYTHON 2:

NumPy is the omnipresent python library for numerical computation. It features tensors of arbitrary rank and a broad set of efficiently implemented operations on them, as well as vectorized notation. Most python-based machine learning frameworks (like TensorFlow or PyTorch) interact closely with it.
Importing and exporting wave files: import scipy.io.wavfile as pywav, pywav.read("<FILE_PATH>.wav"), pywav.write(nparr, samplerate, path)
Batch converting mp3 to wav in a directory: for i in *.mp3; do ffmpeg -i "$i" -acodec pcm_u8 -ac 1 -ar 22050 "${i%.mp3}.wav"; done
Converting a single file: ffmpeg -i <FILENAME>.mp3 -acodec pcm_u8 -ac 1 -ar 22050 <FILENAME>.wav
For .au to .wav: for i in *.au; do sox "$i" "${i%.au}.wav"; done
Delete all files ending with .au: find . -type f -name "*.au" -delete
LibRosa is a library for extracting audio features (especially time/frequency representations): https://github.com/librosa/librosa

This libraries are mostly oriented to CPU work. It is also possible to perform preprocessing and data augmentation on the GPU, but for most musical applications these are fine.

See the related files in this repo for more details.

3. TENSORFLOW, PYTORCH, TENSORBOARD: INSTALLATION (VENV) AND USAGE

Tensorboard: pip install git+https://github.com/lanpa/tensorboard-pytorch, and then from tensorboardX import FileWriter

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
denoising_autoencoder		denoising_autoencoder
simple_music_classifier		simple_music_classifier
wave_to_wave		wave_to_wave
.gitignore		.gitignore
README.md		README.md
au2wav.sh		au2wav.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIOLI-MUSIC:

TODO

THIS IS GOING TO HAPPEN

THIS COULD HAPPEN AND WOULD BE AWESOME

1. DATASETS

FREE MUSIC ARCHIVE:

GTZAN DATASET:

MAGNATAGATUNE DATASET:

MILLION SONG DATASET:

CALAB DATASET:

2. AUDIO PREPROCESSING IN PYTHON 2:

3. TENSORFLOW, PYTORCH, TENSORBOARD: INSTALLATION (VENV) AND USAGE

About

Releases

Packages

Contributors 5

Languages

aioli-ffm/music-projects

Folders and files

Latest commit

History

Repository files navigation

AIOLI-MUSIC:

TODO

THIS IS GOING TO HAPPEN

THIS COULD HAPPEN AND WOULD BE AWESOME

1. DATASETS

FREE MUSIC ARCHIVE:

GTZAN DATASET:

MAGNATAGATUNE DATASET:

MILLION SONG DATASET:

CALAB DATASET:

2. AUDIO PREPROCESSING IN PYTHON 2:

3. TENSORFLOW, PYTORCH, TENSORBOARD: INSTALLATION (VENV) AND USAGE

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages