Skip to content

Predicting customer churn using machine learning algorithms

Notifications You must be signed in to change notification settings

das-amlan/Customer-Churn-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

Customer-Churn-Prediction

This project is aimed at predicting customer churn using machine learning algorithms. The analysis uses several models, including Logistic Regression, Decision Tree Classifier, Random Forest Classifier, Support Vector Machine (SVM), and Artificial Neural Network.

Dataset

The analysis is based on a publicly available dataset from Kaggle. The dataset contains information about customers of a telecommunications company, including demographic information, services used, and whether they churned or not.

Libraries Used

  • pandas
  • NumPy
  • Matplotlib
  • seaborn
  • scikit-learn
  • TensorFlow
  • Keras

Steps

The project is divided into the following steps:

  • Importing required libraries
  • Loading the dataset
  • Exploratory Data Analysis
  • Processing Data for Modeling
    • Handling missing values
    • Removing irrelevant data
    • One-hot encoding for columns with categorical data
    • Scaling data
    • Handling imbalanced data using the SMOTE method
  • Test Train Split
  • Modeling and Evaluation

Each step is explained in detail in the notebook.

Outcome

The Random Forest Classifier achieved an impressive accuracy of 84% in our customer churn prediction model. In addition, our precision, recall, and f1 score metrics also showed better performance with the Random Forest Classifier. These results indicate that our model is effective at predicting which customers are most likely to churn, allowing us to take proactive steps to retain valuable customers and improve the overall performance of our business.