Skip to content

207 Machine Learning Project using various clustering models

Notifications You must be signed in to change notification settings

phoebeyueh/207_customer_segmentation

Repository files navigation

👩‍🎓 About

I completed this project as part of the Machine Learning class at UC Berkeley alongside my peers Chloe Nguyen and Catherine Liao. I want to express my gratitude for their valuable contributions to the project!

💻 Introduction

This project focuses on conducting a comprehensive customer personality analysis for optimizing marketing strategies. Through extensive data preprocessing and clustering techniques such as KMeans and Agglomerative Clustering, we aim to identify four distinctive customer segments to empower the company with actionable insights to tailor products and targeted campaigns, thereby maximizing customer engagement and conversion rates.

🔢 Dataset

Dataset: Customer Personality Analysis from Kaggle

Source: https://www.kaggle.com/datasets/imakash3011/customer-personality-analysis

❓ Models Used

  1. Mini Batch K-Means Clustering
  2. Agglomerative Clustering
  3. DBSCAN
  4. GMM

🔑 Key Findings

All models showed similar results.

Common personas among the models:

  • Customer persona 1: Lowest income families (1-2 kids) with low spending habits across all products and all purchasing methods
  • Customer persona 2: Moderate income families (1-2 kids) with moderate spending habits and high deals, stores and web purchases
  • Customer persona 3: High income singles/ small families (0-1 kid) with high spending habits across all product categories- especially wine sales
Screenshot 2024-04-26 at 10 57 30 AM

Releases

No releases published

Packages

No packages published