This a simple RFM Analysis Using K Means Clustering On A Publicly Available Brazilian e Commerce Dataset on Kaggle
This repo is accompanied by a Medium article availabe here: https://zhijingeu.medium.com/building-a-rfm-segmentation-with-python-k-means-clustering-3a8f3c202fa5
I've avoided duplicating the data which is available from Kaggle here: https://www.kaggle.com/datasets/olistbr/brazilian-ecommerce where the relevant datasets are: olist_customers_dataset.csv; olist_orders_dataset.csv ; olist_order_items_dataset.csv ; olist_order_payments_dataset.csv ; olist_products_dataset.csv
Consider checking out my other repositories too ! :
- https://github.com/ZhijingEu/Cohort_Retention_Analysis - This is an implementation of a custom Customer Retention Analysis class with a number of helpful methods to generate customer churn insights frequently used for marketing analytics to understand the growth and change of an organisation's customer base (new vs retained vs lost)
- https://github.com/ZhijingEu/Customer_Lifetime_Values_BTYD_Modelling_PyMCMarketing - This a simple Customer Lifetime Value analysis using Buy Till You Die Modelling With PyMC Marketing library https://www.pymc-marketing.io