Skip to content

A dataset of danish reviews scraped from trustpilot.com, to be used to train sentiment classification models.

Notifications You must be signed in to change notification settings

AlessandroGianfelici/danish_reviews_dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Danish reviews dataset

A dataset of danish reviews to train sentiment classification models.

This dataset has been collected from the internet using web scraping techniques. For further information take a look to the code:

https://github.com/AlessandroGianfelici/trustpilot_spider.git

Data

For each data point, the dataset contains the company name (hashed for privacy reasons), the title of the review, the text of the review and the number of stars (from 1 to 5).

Usage

The data are stored as a txt file with comma separated fields. For example, if you're using python you can load it with pandas:

import pandas as pd

data = pd.read_csv('raw_data.txt')

Support

ko-fi

About

A dataset of danish reviews scraped from trustpilot.com, to be used to train sentiment classification models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published