HelloFresh Case Study

Data science case study on subscription churn rate, based on real-world HelloFresh data. The report includes an in-depth description of my work and thought process. This project has been carried out in 4 days (time allotted for the challenge).

Prerequisites

I used Conda 4.5.11 with Python 3.6.5 on a machine implementing Windows 10 64-bit to set up my PyData stack. I worked within a new environment defined and activated via terminal as in the following:
> conda create --name hellofresh python=3.6.5 numpy pandas matplotlib seaborn scikit-learn py-xgboost
> conda activate hellofresh

The data used in this case study can be found here. If you would like to reproduce my results download the data files, store them in a "data" folder and run the main script.

The main.py script executes the whole workflow and saves the output in a logfile.
> python main.py

Highlights

EDA: framed the problem and leveraged ambiguous features by thinking out of the box.
Feature-engineered a new dataset more suitable to the task at hand and applied relevant models.
My pipeline involved one-hot encoding, normalization, multicollinearity assessment, grid search, logistic regression and XGBoost with early stopping.
My best model achieved 81.5% accuracy and performed statistically significantly better than the naïve baseline on the test set (61%).

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
code		code
visualizations		visualizations
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
logfile.log		logfile.log
main.py		main.py
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HelloFresh Case Study

Prerequisites

Highlights

About

Releases

Packages

Languages

License

carlocontaldi/hellofresh_case_study

Folders and files

Latest commit

History

Repository files navigation

HelloFresh Case Study

Prerequisites

Highlights

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages