This project involves building and training a machine learning model using PySpark to predict customer churn. The dataset contains customer attributes and churn status, and it goes through data exploration and preprocessing steps initially. Then, machine learning algorithms such as logistic regression and gradient boosting machine (GBM) are used to build the model. Finally, various metrics are used to evaluate the model's performance, and model hyperparameters are tuned using CrossValidator.
-
Notifications
You must be signed in to change notification settings - Fork 0
mesudepolat/Churn-PySpark
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Churn Prediction using PySpark
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published