Skip to content
View JoseJuan98's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report JoseJuan98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JoseJuan98/README.md

Hi, I'm Jose 🧑

StackOverflow

Experienced in the fields of Artificial Intelligence, Data Science, Software Engineering, and Big Data. I specialize as Machine Learning Engineer and provide consulting services for AI & Data related projects.

Highly motivated with a passion for exploring the intersections of Mathematics, Data, and Computer Science, and how they can be used to solve real-world problems.

The repositories you'll see here are mostly for my learning journey towards growing in my career in data science and machine learning engineering or simply fulfilling my personal quest for knowledge.

Featured projects 🔭

  • ml-regression-project-california-housing: showcase my ability to produce a successful and resilient Machine Learning system from the EDA to the engineering phase. It also shows my consultant skills in all the cycles of a project of this kind.
  • scratch-nn: python library for building simple Deep Neural Networks from scratch, using only vectorized operations with NumPy to develop further understanding in the base concepts of neural networks and provide a foundation for building a Production-ready Python library.
  • [ml-classification-titanic]: show my skills as Data Scientist and ML Engineer. (still to be released, working on it currently)
  • statistical-rethinking: repository to practice statistics with Python libraries learned in the book of the same name. (Abandoned atm, I feel my statistical skills are proficient for Data Science and ML, but maybe one day I will come to the basics again to learn a statistical framework)
  • block-chain-java: simulate the functionality of blockchain technology used by crypto-currencies.

Where I Currently Work 💼

Capgemini Engineering: Machine Learning Engineer, AI & Data Consultant

Worked on projects related to:

  • Exploratory Data Analysis, Data Mining, and design and built predictive services for different sectors like Marketing, Life Sciences, Manufacturing, Finances ...
  • Developed Machine Learning Operations (MLOps) and pipelines, based on Python, and AWS Sagemaker, with data versioning using Feature Stores, models versioning in Model Registries, and history tracking of experiments and deployments.
  • Mix Marketing Modeling with Bayesian MCMC using PyMC for a pharmaceutical company.
  • Analyzed data from large-scale databases and developed Machine Learning models in Python with Scikit-Learn and TensorFlow and resilient predictive services to serve them.
  • Facilitated the serving and CI/CD workflows for an operations optimization engine based in Pyomo and built a microservice with Docker to deploy it in Kubernetes.
  • Increase team productivity and code quality by creating a guide consolidating good development, documenting, and testing practices, applying DevOps principles, and developing a Python library to abstract Machine Learning workflows.
  • Designed the architecture and operations of microservices for a Federated Machine Learning platform with flwr, Tensorflow Federated, Django, Mlflow, and Apache Airflow.
  • Contributed to AWS SageMaker SDK to facilitate the use of Feature Stores.
  • Conducted a talk about MLOps with AWS at a conference

What Currently I am studying 📚

I am a master's student in the Applied Artificial Intelligence online program at the Luleå University of Technology As this takes most of my time, in addition to working the learning paths below got stopped.

Topics to learn 🌱

  • Mix marketing modeling (learning currently). Work related, so the project cannot be shown.
  • Bayesian Statistics (stopped), primarily through the book Statistical Rethinking, and some articles that you can find in towards datascience. At the moment this is my progress

  • Time Serie Analysis (finished), I am mainly using any source that I find interesting from this repository which contains a lot of references to learn. Unluckily, any project related to TSA cannot be shown as there are used in my job :(.

  • Neural Networks and Deep Learning (finished), by the Deep Learning Specialization of Coursera, and I have been enjoying being exposed to Linear Algebra and Deep Learning. My progress (First course, finished)

  • Data Engineering with Apache Spark and Delta Lakes (finished). Acquiring a Data Engineer Professional Certificate from Databricks. I enjoy data modeling and creating data pipelines to support Data Science projects.

  • MLOps, DevOps, and AWS SageMaker (finished). Learned and applied in all of the projects at work.
  • Cloud (Azure, AWS) and CI/CD tools (Azure DevOps, GitHub Actions, AWS CodeBuild, CodeDeploy, and CodePipeline) (finished). Learned and applied in all of the projects at work.

👯 I’m looking to collaborate on

Open source projects related to Environmental Intelligence to help to preserve nature or evaluate the state of it.

Certifications already earned

See more at my Linkedin profile.

Pinned Loading

  1. scratch-nn scratch-nn Public

    NNs from scratch

    Jupyter Notebook 2 1

  2. data-pipeline-enterprise-architecture data-pipeline-enterprise-architecture Public

    Project developed for monitoring the enterprise environment of Bergen Kommune. Data pipelines, data modelling and management of data.

    Python

  3. taxis-fare-mlib-pyspark taxis-fare-mlib-pyspark Public

    The software technology which will be discussed in this report is Apache Spark, an unified tool for analysing and processing data. In addition, PySpark is going to be used to write the code in iPyt…

    TeX 1

  4. BHike_AngularJS BHike_AngularJS Public

    Web page implemented with the framework AngularJS

    Java 1

  5. josejuanWSB/printer_format_service josejuanWSB/printer_format_service Public

    CI/CD Pipeline with Flask, Pytest, Travis, and Docker for students to learn the principles of DevOps at WSB University.

    Python