Skip to content
View longNguyen010203's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Organizations

@NTL-DE
Block or Report

Block or report longNguyen010203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
longNguyen010203/README.md

Hey, I'm Long Nguyen πŸ‘‹

I'm an Artificial Intelligence major in Vietnam, passionate about data engineering. I'm actively honing my skills, expanding my knowledge, and seeking work opportunities in this field, with a current focus on deepening my understanding of cloud technologies.

πŸ“¦ Technologies

Languages: Python SQL PySpark Shell C++

Architecture: ETL ELT Lambda Kappa Star Schema Snowflake Schema

Processing: Spark Kafka Airflow Dagster Dbt Pandas Polars Airbyte Selenium BeautifulSoup

Storage: Snowflake RDS DynamoDB Redshift S3 SQL Server PostgreSQL MySQL MinIO SQLite

AWS: CloudFormation S3 EC2 IAM VPC Redshift EMR Glue RDS Lambda DynamoDB Kinesis

DevOps: Docker Zookeeper Terraform GitHub Actions Git GitLab

πŸ“’ Certificates

Β Β Β Β Β Β  Β Β Β Β Β Β Β  Β Β Β Β 
Β Β Β  Β Β Β Β Β Β Β Β  Β Β Β Β Β Β  Β Β Β Β Β Β 

πŸ“« Contact

Connect with me, LinkedIn

Pinned Loading

  1. LONGNGUYEN--AWS-Trainning--2024 LONGNGUYEN--AWS-Trainning--2024 Public

    ☁️🌈πŸ”₯ Welcome to my AWS Cloud Training repository! This repo contains notes, exercises, and projects from my AWS Cloud training journey, showcasing my progress and understanding of AWS services. πŸ’¨

    JavaScript 1

  2. Youtube-ETL-Pipeline Youtube-ETL-Pipeline Public

    πŸ’œπŸŒˆπŸ“Š A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api 🌺

    Jupyter Notebook 9 1

  3. Spark-Kafka-Self-Learning Spark-Kafka-Self-Learning Public

    πŸ“šπŸŒŠπŸŽ“ A third-year student is self-studying Spark and Kafka as part of their πŸ‘· data engineering journey, with the goal of securing an πŸ“¬ internship or fresher job in 2024.

    Shell 1

  4. Spark-Processing-AWS Spark-Processing-AWS Public

    πŸ‘·πŸŒ‡ Set up and build a big data processing pipeline with Apache Spark, πŸ“¦ AWS services (S3, EMR, EC2, IAM, VPC, Redshift) Terraform to setup the infrastructure and Integration Airflow to automate wor…

    Python 1

  5. 100Day-Self-Learning-DE 100Day-Self-Learning-DE Public

    πŸ“šπŸ’»βŒ¨ Self-study process for more than 3 months with 3-4h/day to prepare for the journey of applying for an intern or fresher position as a Data Engineer in 2024 ️πŸ₯‡οΈπŸ†

    Jupyter Notebook 1

  6. ECommerce-ELT-Pipeline ECommerce-ELT-Pipeline Public

    πŸŒ„πŸ“ˆπŸ“‰ A Data Engineering Project 🌈 that implements an ELT data pipeline using Dagster, Docker, Dbt, Polars, Snowflake, PostgreSQL. Data from kaggle website πŸ”₯

    Python 1