Skip to content

Latest commit

 

History

History
37 lines (25 loc) · 1.77 KB

README.md

File metadata and controls

37 lines (25 loc) · 1.77 KB

IntroToPandas

This repo contains material covered in 'Intro to Pandas' talk given as a part of Victoria Data Science Meetup. For more details about the Data Science community in Victoria, please visit the following link - Victoria Data Science Meetup

Requirements

Environment Setup

  • Create Anaconda environment. Open your terminal and type the following commands.
$ conda create -n Intro2Pandas python=3.6
$ source activate Intro2Pandas
(Intro2Pandas) $ conda install pandas
(Intro2Pandas) $ conda install jupyter notebook

Dataset

For this talk we will be using the Kaggle's Competition dataset - 'House Prices: Advanced Regression Techniques'. Kaggle is a great place to participate and learn Data Science related concepts and it has a rich community ranging from Beginners to Advanced. If you haven't created an account yet, please sign up, join the competition and download the data from the following link - https://www.kaggle.com/c/house-prices-advanced-regression-techniques

Tutorial

Once the environment is setup. You can run jupyter notebook and start following the Intro to Pandas.ipynb notebook. The objectives of the talk are as follows:

  • Get familiarized with Kaggle
  • Understand how to use Pandas library
  • Cover foundations for data science.
  • Submit final results to the Kaggle competition.

By the end of this talk, you will see yourself on the leaderboard of an ongoing data science competition on Kaggle :-)

Author

Sri Raghu Malireddi / @r4ghu