Skip to content

This project is based on the Kaggle challenge called 'Titanic: Machine Learning from Disaster.' It aims to use existing data regarding passengers aboard the Titanic, including their age, gender, cabin class, ticket fare, etc. to build a model that can predict the survival of a passenger from the test set.

Notifications You must be signed in to change notification settings

emilbiju/Titanic_survival_challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Titanic Survival Challenge

This project is based on the Kaggle challenge called 'Titanic: Machine Learning from Disaster.' It aims to use existing data regarding passengers aboard the Titanic, including their age, gender, cabin class, ticket fare, etc. to build a model that can predict the survival of a passenger from the test set.

The following steps have been taken to process the data and feed it to various classfication models.

  1. Data Cleaning and substitutions for null values
  2. Encoding categorical variables using the sklearn library
  3. Reducing skewness of numerical variables using log transfrmation
  4. Creating new, meaningful features that capture the relevant information better.
  5. Making trial runs with various classification models and assessing accuracy using 5-fold cross validation

About

This project is based on the Kaggle challenge called 'Titanic: Machine Learning from Disaster.' It aims to use existing data regarding passengers aboard the Titanic, including their age, gender, cabin class, ticket fare, etc. to build a model that can predict the survival of a passenger from the test set.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published