GitHub - phoebeyueh/207_customer_segmentation: 207 Machine Learning Project using various clustering models

👩‍🎓 About

I completed this project as part of the Machine Learning class at UC Berkeley alongside my peers Chloe Nguyen and Catherine Liao. I want to express my gratitude for their valuable contributions to the project!

💻 Introduction

This project focuses on conducting a comprehensive customer personality analysis for optimizing marketing strategies. Through extensive data preprocessing and clustering techniques such as KMeans and Agglomerative Clustering, we aim to identify four distinctive customer segments to empower the company with actionable insights to tailor products and targeted campaigns, thereby maximizing customer engagement and conversion rates.

🔢 Dataset

Dataset: Customer Personality Analysis from Kaggle

Source: https://www.kaggle.com/datasets/imakash3011/customer-personality-analysis

❓ Models Used

Mini Batch K-Means Clustering
Agglomerative Clustering
DBSCAN
GMM

🔑 Key Findings

All models showed similar results.

Common personas among the models:

Customer persona 1: Lowest income families (1-2 kids) with low spending habits across all products and all purchasing methods
Customer persona 2: Moderate income families (1-2 kids) with moderate spending habits and high deals, stores and web purchases
Customer persona 3: High income singles/ small families (0-1 kid) with high spending habits across all product categories- especially wine sales

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
README.md		README.md
customer_personality_analysis_agglomerative.ipynb		customer_personality_analysis_agglomerative.ipynb
customer_personality_analysis_dbscan.ipynb		customer_personality_analysis_dbscan.ipynb
customer_personality_analysis_gmm.ipynb		customer_personality_analysis_gmm.ipynb
customer_personality_analysis_k_means.ipynb		customer_personality_analysis_k_means.ipynb
final_slides.pdf		final_slides.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

👩‍🎓 About

💻 Introduction

🔢 Dataset

Dataset: Customer Personality Analysis from Kaggle

❓ Models Used

🔑 Key Findings

About

Releases

Packages

Languages

phoebeyueh/207_customer_segmentation

Folders and files

Latest commit

History

Repository files navigation

👩‍🎓 About

💻 Introduction

🔢 Dataset

Dataset: Customer Personality Analysis from Kaggle

❓ Models Used

🔑 Key Findings

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages