Celestial Body Size Predictor

By: Nate DiRenzo

Statement of Need:

Potentially Hazard Objects (PHO) are near-Earth objects with an orbit that can bring them within close proximity to the planet, and large enough to cause significant damage in the event of an impact.

Asteroids larger than 35 meters in daimater can pose a threat to a city or town. However, the diameter of most small celestial objects is not well determined, as they are usually estimated using brightness and distinace, as opposed to direct radar measurements.

Because the true size of most celestial objects is not well determined, we will strive to produce a model that can accurately estimate the diameter of objects in space, given a set of easily observable features.

Goal:

The goal of this project is to productionize a model that predicts the diameter of celestial objects with some degree of accuracy. To do so, we will store a database of 800,000 measurements of celestial objects in Google Cloud Storage, create a model with Python using PySpark, and a front-end web application with Streamlit. As a further goal, I would like to containerize the script and web application using Docker.

Success Metrics:

The metric for success is whether or not the model functions in production, and to a lesser extent the efficacy of the model at predicting size of celestial objects.

Data Description:

The data is taken from the Jet Propulsion Laboratory at the California Institute of Technology. The full Small-Body Database contains 1.2million entries with measurements of objects in our solar system. roughly 150,000 entries contain diameter data.

Tools:

Google Cloud Storage for Data Warehousing
Google Colab for Cloud-based Scripting
PySpark for Modelling
Streamlit for Frontend Application
Docker for Containerization

Models:

Gradient Boosted Tree Regression with pySpark

MVP Goal:

Basic implimentation of the model running on local machine, and deployed to streamlit. From there, I can work on expanding the data and moving the entire pipeline into the cloud.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
images		images
.gitattributes		.gitattributes
.gitignore		.gitignore
Asteroid Predictor Presentation.pdf		Asteroid Predictor Presentation.pdf
LICENSE		LICENSE
README.md		README.md
asteroid_size_predictor.ipynb		asteroid_size_predictor.ipynb
mvp_writeup.md		mvp_writeup.md
project_proposal.md		project_proposal.md
project_writeup.md		project_writeup.md
pyspark_model_script.ipynb		pyspark_model_script.ipynb
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py
xgb_model.json		xgb_model.json
xgb_model_script.ipynb		xgb_model_script.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Celestial Body Size Predictor

By: Nate DiRenzo

Statement of Need:

Goal:

Success Metrics:

Data Description:

Tools:

Models:

MVP Goal:

About

Releases

Packages

Languages

License

VinceDiR/celestial_body_size_predictor

Folders and files

Latest commit

History

Repository files navigation

Celestial Body Size Predictor

By: Nate DiRenzo

Statement of Need:

Goal:

Success Metrics:

Data Description:

Tools:

Models:

MVP Goal:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages