JOB-A-THON---January-2023---Regression-Problem

This is a repository to Jobathon January 2023 from Analytics Vidhya

Problem Statement VahanBima is one of the leading insurance companies in India. It provides motor vehicle insurances at best prices with 24/7 claim settlement. It offers different types of policies for both personal and commercial vehicles. It has established its brand across different regions in India. Around 90% of the businesses today use personalized services. The company wants to launch different personalized experience programs for customers of VahanBima. The personalized experience can be dedicated resources for claim settlement, different kinds of services at doorstep, etc. Inorder to do so, they would like to segment the customers into different tiers based on their customer lifetime value (CLTV). Inorder to do it, they would like to predict the customer lifetime value based on the activity and interaction of the customer with the platform. So, as a part of this challenge, your task at hand is to build a high performance and interpretable machine learning model to predict the CLTV based on the user and policy data.

•I did some EDA (check null values, check distributions, skewness, checked unique values, correlation) I found that there was skewness in two numerical columns. • Following that I cleaned data from outliers. • Then I made baselines of following models: Linear regression, SGD, Random Forest, SVC, Ridge, Lasso, ELnet, XGB and checked which model is performing best. • Then, I trained all the models in 5-fold and 10-fold Cross-Validation. • I found that XGB model is performing the best and I made hyperparametr tuning for it. • After that I experimented various type of pre-processing: StandardScaler, Log-Transfoormation, Power-Transformation, RobustScaler(). I choose • Encoding technique – encoded with .get_dummies() • I tried use Ensemble technique: Lass0+SVR+XGB or Lasso+SVR+RF+XGB but i didn’t get a good score. • My Final Model tuned XGB gave me on testing set: R2= 0.16212919350405086 and final score I got R2= 0.1537933031

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dashboard 1.png		Dashboard 1.png
Dashboard 2.png		Dashboard 2.png
Jobathon_jan_XGB_tuned_Final (1).ipynb		Jobathon_jan_XGB_tuned_Final (1).ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JOB-A-THON---January-2023---Regression-Problem

About

Releases

Packages

Languages

License

Katerinafomkina/JOB-A-THON---January-2023---Regression-Problem

Folders and files

Latest commit

History

Repository files navigation

JOB-A-THON---January-2023---Regression-Problem

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages