Skip to content

A diamond carat prediction model and data engineering pipeline on AWS.

License

Notifications You must be signed in to change notification settings

DavideAG/Diamonds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Diamonds

This repository contains a diamond carat prediction model developed on AWS. The whole project also includes the construction of a data engineering pipeline capable of capturing requests from a Kinesis data stream topic and then forwarding them to an API Gateway connected to the model exposed by SageMaker.

The entire architecture respects the serverless paradigm and has been defined using the Terraform modules in the repository.

image-20210711161121292

DyanamoDB is used to store the predictions of the diamonds and an S3 bucket stores the messages coming from the topic.

A front-end was developed to interact with the rest api using the web browser. The idea is to deploy it in an S3 bucket as showed in the following structure:

image-20210711161822539

This project is part of the exam "Data: Platforms and vendor solutions" of the master 2nd level Specializing Master, AI, ML and Cloud Computing at Politecnico di Torino.

About

A diamond carat prediction model and data engineering pipeline on AWS.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published