Skip to content

This project centers around the ETL (Extract, Transform, Load) process and visualization of the NYC Airbnb dataset, obtained from Kaggle. The dataset encompasses a rich collection of information related to Airbnb listings in New York City, covering aspects such as property details, pricing, location, and host characteristics.

Notifications You must be signed in to change notification settings

dominikvukelic/airbnb_ny_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NY Airbnb Dataset ETL and Visualization

Overview

This repository focuses on the ETL (Extract, Transform, Load) process and visualization of the NY Airbnb dataset, obtained from Kaggle. The project provides a structured workflow for cleaning the original dataset, performing exploratory data analysis, and creating insightful visualizations.

Repository Structure

  • starting_file/: Contains the original dataset from Kaggle.

    • Airbnb_Open_Data.csv: The raw data file downloaded from Kaggle.
  • cleaning_starting_file/: Contains Jupyter Notebook for cleaning and transforming the starting file.

    • airbnb_ny_df.ipynb: Jupyter Notebook documenting the process of importing the starting file, cleaning, and analyzing it, and exporting the cleaned dataset.

    • exported_cleaned_starting_file/: Contains the cleaned dataset exported from the ETL process.

    • airbnb_df_cleaned.csv: The cleaned and processed dataset, ready for analysis.

  • graphs/: Contains Jupyter Notebook for data analysis and visualizations.

    • graphs.ipynb: Jupyter Notebook documenting the process of cleaning and transforming the starting file, and creating visualizations.

Usage

Explore the cleaned dataset and leverage the insights gained from the analysis. The Jupyter Notebooks serve as comprehensive guides, providing transparency into the ETL process and visualization techniques applied to the NY Airbnb dataset.

About

This project centers around the ETL (Extract, Transform, Load) process and visualization of the NYC Airbnb dataset, obtained from Kaggle. The dataset encompasses a rich collection of information related to Airbnb listings in New York City, covering aspects such as property details, pricing, location, and host characteristics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published