YouTube Trending - MLOps

What is this repo about ❔

This repo shows how to deploy and manage machine learning models in production.

Steps covered:

Define our problem and perform EDA
Develop an ETL pipeline
Train a model
Deploy the model to cloud
Develop and deploy a retraining pipeline
Monitor the model performance

The focus is on the tools and ML best practices. In particular, dockerizing and deploying to AWS the two key pipelines: retraining and inference. The problem itself - predicting YouTube views from just the channel name and video category - is rather trivial, and would usually be more complex in the real world. However, the methods of managing the ML lifecycle are very relevant and can be used to deploy real-world projects.

Inference endpoint available at: mlprojectsbyjen.com

📖 Table of contents

➤ Inference pipeline

Capabilities
AWS infrastructure
Tools

➤ Retraining pipeline

Capabilities
AWS infrastructure
Tools

➤ Repo structure

📝 Inference pipeline

Capabilities

Inference pipeline consists of two components: web endpoint and prediction API. The web endpoint is resposible for the user interface. Prediction API is resonsible for accepting requests from the web endpoint and responding with the predictions made by ML model. The components are separated using Elastic Load Balancers (ELB). Each component is wrapped in a docker container, deployed using Elastic Container Service (ECS) and placed in an Auto Scaling Group (ASG), allowing for quick scalability. All the services are spread across 3 Availability Zones (AZ) ensuring high availability.

AWS insfrastructure

The architecture follows a simple 2-tier design. The traffic flows from users to the external Application Load Balancer (ALB), which is then distributed across Elastic Container Service (ECS) Tasks. When the user presses predict on the web app, a request is sent to the internal ALB. The App tier Tasks compute the ML prediction and return in back to the Web tier, where the results are displayed back to the user.

* Why is the App tier public? Because NAT Gateways are expensive for a small project such as this one - around 40$ per month per AZ. There are no security concerns so making the App Tier public seems most reasonable.

** In reality there are 3 AZs configured

*** Depending on when you are reading this, the endpoind mlprojectsbyjen.com might actually use a monolith deployment instead of a 2-tier architecture. It doesn't scale that well but allows for less Tasks to be running which cuts costs.

Tools

The app itself uses standard ML python libraries: Pandas, scikit-learn, XGBoost, FastAPI and Streamlit. Neptune AI is used for experiment tracking and as a model registry.

AWS Service choices:

Compute - ECS for ease of deployment
Storage - S3 for scalability and AWS integrations
Feature Store - DynamoDB for quick read access
Scaling and High Availability - ALB and ASG as they are the recommended standard in AWS
Access and security - IAM Roles for AWS access and SSM Parameter Store for distributing keys for external services such as Neptune AI

📝 Retraining pipeline

Capabilities

In progress...

AWS insfrastructure

In progress...

Tools

In progress...

📝 Repo structure

In progress...

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
configs		configs
data_ingestion		data_ingestion
data_transformation		data_transformation
dev		dev
feature_store_update		feature_store_update
orchestration		orchestration
predict_api		predict_api
predict_monolith_deployment		predict_monolith_deployment
training		training
web_endpoint		web_endpoint
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
docker-compose.yaml		docker-compose.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YouTube Trending - MLOps

What is this repo about ❔

📖 Table of contents

📝 Inference pipeline

Capabilities

AWS insfrastructure

Tools

📝 Retraining pipeline

Capabilities

AWS insfrastructure

Tools

📝 Repo structure

About

Releases

Packages

Languages

JenAlchimowicz/YouTube-Trending-MLops

Folders and files

Latest commit

History

Repository files navigation

YouTube Trending - MLOps

What is this repo about ❔

📖 Table of contents

📝 Inference pipeline

Capabilities

AWS insfrastructure

Tools

📝 Retraining pipeline

Capabilities

AWS insfrastructure

Tools

📝 Repo structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages