Skip to content

A machine learning project on an imbalanced credit card data that detects fraudulent transactions.

Notifications You must be signed in to change notification settings

AI-14/credit-card-fraud-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Credit Card Fraud Detection

Table of contents

  1. Description
  2. Dataset
  3. Results
  4. Steps taken in this Project
  5. Installation and Usage

Description

A machine learning project on an imbalanced credit card data that detects fraudulent transactions. Out of the whole dataset, only 0.17% of the transactions were fraudulent. This makes it very hard to detect the outliers in the data with a good accuracy. I visualized certain aspects of the dataset to gain any useful insight. Due to long time of training the models, I used only 20% of the data to train and test our model.

Given below is an easy explanation of credit card fraud detection:

Dataset

Link of the dataset is in the jupyter notebook

Results

I used 5 classification algorithms. The key point was not to gain high overall accuracy but high accuracy for only outlier detection because we have to prevent fraudulent transactions not the valid ones. At first it was very difficult to choose certain model for high accuracy. However, I thought of using ensemble model - Random Forests and got the outlier's accuracy to be 81%. I also used an unsupervised algorithm which is used for the sole purpose of anomaly detection - Isolation Forests and it performed well on 20% of the dataset. I could choose either of these models for prediction but chose random forest and saved it using pickle.

Steps taken in this Project

  • Data collection
  • EDA & visualization
  • Model selection & building it
  • Evaluation of the models
  • Saving the models

Installation and Usage

  • Requirements:
    • python >= 3.4
  • git clone https://github.com/AI-14/credit-card-fraud-detection.git - clones the repository
  • cd credit-card-fraud-detection
  • py -m venv yourVenvName - creates a virtual environment
  • yourVenvName\Scripts\activate.bat - activates the virtual environment
  • pip install -r requirements.txt - installs all modules

About

A machine learning project on an imbalanced credit card data that detects fraudulent transactions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published