Skip to content

A crawler made with Scrapy that crawls the lyrics for all songs of a band from Vagalume given the band home page

License

Notifications You must be signed in to change notification settings

Naraujo13/lyrics_crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lyrics Crawler

Overview

A crawler made with Scrapy that crawls the lyrics for all songs of a given band.

Currently the crawler data source is Vagalume.

Requirements

  • Python Python 3.6+
  • Works on Linux, Windows, Mac OSX, BSD
  • Scrapy

Installing and Executing

pip install scrapy
python vagalume_band_crawler.py band_home page

The results will be saved into the data folder, with one tsv with all the song names and links and a json with song names, their respective album and lyrics.

Example

The bands home page is usually just the band name with dashes - instead of spaces.

python vagalume_band_crawler.py https://www.vagalume.com.br/green-day/

About

A crawler made with Scrapy that crawls the lyrics for all songs of a band from Vagalume given the band home page

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages