Skip to content

This project goes through the process of classifying text messages in spam and not spam messages.

License

Notifications You must be signed in to change notification settings

evertonbin/spam-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Text Messages - Spam Classification

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

All the necessary libraries to run the code were already available in the Anaconda distribution of Python, except:

  1. WordCloud, which can be easily installed with !pip install wordcloud.

This script was written using Python version 3.*.

Project Motivation

Since we all receive spam messages in our email accounts or through our cell phones, I wanted to explore text messages to understand why spam filters are not perfect, even with the advances in technology. Are there significant differences between spam and not spam messages? Is there something we can do to improve the models in the process of idetifying these differences?

File Descriptions

  1. Notebook - Jupyter Notebook with the script developed for classifying spam text messages.
  2. Csv file - a copy of the dataset containing text messages previously classified as spam and not spam.

Results

As a result, different models were created with different approaches in order to improve the performance in classifying spam messages, and some new features were created while analyzing differences between spam and not spam messages.

Licensing, Authors, Acknowledgements

Credits must be given to the University of Michigan for making the data available along in its Applied Text Mining in Python course available in the Coursera platform.

About

This project goes through the process of classifying text messages in spam and not spam messages.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published