Skip to content
View fabio-cancio-sena's full-sized avatar

Block or report fabio-cancio-sena

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
fabio-cancio-sena/README.md

Fábio Sena

Email: fabio.cancio.sena@gmail.com | Phone: +55 61 992015000 LinkedIn: https://linkedin.com/in/fabio-sena-mlengineer

Summary

Hey there! I'm Fábio Sena, a Senior Machine Learning Engineer at NuBank, passionate about tackling intriguing machine learning problems. With seven years of experience, I've explored the lesstraveled paths, collaborating with tech industry leaders to deliver innovative and game-changing solutions.

My expertise lies in weaving together machine learning, cloud computing, algorithms, and data structures to build highly efficient and scalable models. At Appen, I successfully deployed nearly 30 models in production, while also contributing to the design and migration of ML cloud architecture, enhancing scalability and efficiency. Additionally, I possess expertise in computer vision and natural language processing.

During my time at behup, I designed a robust cloud data architecture utilizing Spark and Elasticsearch. This enabled efficient management, processing, and searchability of millions of data points, providing valuable data-driven insights. Leveraging classical machine learning and NLP techniques, I extracted valuable information from textual data, facilitating decision-making.

My background in software architecture enables me to envision the bigger picture and build robust, scalable systems. By fitting ML solutions seamlessly into broader tech frameworks, I ensure smooth and efficient outcomes.

Apart from my practical experience, I hold an MBA in Data Science and Big Data, enhancing my planning, cloud computing, communication, and advanced data analytics skills. My capstone project on CV Object Detection showcased cutting-edge computer vision techniques.

As an advocate of lifelong learning, I stay updated with the latest advancements in the field. Fluent in English, Portuguese, and Spanish, I thrive in collaborating with multicultural teams to deliver impressive results.

Creating scalable and reliable machine learning models that make a real difference fuels my passion. If you have a project or an idea you'd like to discuss, I'm eager to listen. Let's connect and leave our mark in the world of machine learning together!

Skills

  • Programming Languages: Python, SQL
  • Cloud Computing and Big Data: Amazon Web Services (AWS), AWS SageMaker, Cloud Computing, Data Engineering, Apache Spark, Hadoop, Distributed Computing
  • **Machine Learning Libraries and Techniques: PyTorch, TensorFlow, Scikit-Learn, Natural Language Processing, Reinforcement Learning, Unsupervised Learning, Convolutional Neural Networks (CNN), Generative Adversarial Networks (GANs), Predictive Modeling
  • Deep Learning and Natural Language Processing Models: BERT, GPT-3
  • MLOps and CI/CD: Docker, Kubeflow, Machine Learning Pipeline, Continuous Integration and Continuous Delivery (CI/CD), Airflow, Kubernetes, MLOps
  • Advanced and Emerging Fields: Differential Privacy, Federated Learning, LlamaIndex, LangChain
  • Soft Skills: Effective Communication, Problem Solving, Critical Thinking, Project Management, Teamwork/Collaboration, Adaptability, Emotional Intelligence

Experience

Machine Learning Engineer, Novatics, Aug 2020 - Present

  • Led development of ML solutions for Appen, a data labeling company, including image deduplication, batch inference, and trainable models using transfer learning
  • Designed and built stream-based real-time ML architecture using Apache Kafka
  • Developed cutting-edge computer vision models for face detection and blurring, license plate detection and blurring, text transcription, car object detection from LiDAR data, and label bounding boxes in street scene images
  • Designed and implemented models for audio/speech processing and speaker diarization
  • Built a dashboard to report ML model usage by projects, clients, and periodicity using Mode Analytics

Machine Learning Engineer, behup, Apr 2016 - Jul 2020

  • Created a suite of 6 products to understand consumer buying behavior and product pricing strategy using alternative choice modeling techniques and large data samples
  • Evaluated video ad's effectiveness using data science and neurosciences
  • Built a product to automatically understand consumer opinions from videos and audio using speech-to-text and a combination of NLP techniques

Machine Learning Engineer, Luiza AI, Nov 2017 - Jul 2018

  • Designed and implemented an end-to-end ML pipeline for Luiza.ai, including credit default risk prediction and customer recommender system using cluster analysis

Projects

  • Implemented a scalable and trainable message-based ML architecture using AWS SageMaker, Kafka, AWS EventBridge, SNS, SQS, and Batch Transformer
  • Developed a natural language processing pipeline to extract valuable information from legal case files in PDF format, grouping related files into hierarchical clusters
  • Built a customer-centric recommender system using cluster analysis and the R library RecommenderLab
  • Built an end-to-end pipeline using XGBoost to predict credit default risk
  • Used advanced business analytics to create predictive models

Education

  • MBA in Data Science and Big Data - IGTI - Institute of Management in Information Technology - 2016 - 2017
    • Focused on management, big data, statistics, data science, and machine learning.
    • Relevant coursework includes infrastructure and cloud computing, big data fundamentals, data modeling with Hadoop, statistical analysis, and machine learning applied to big data.
  • Bachelor's degree, Bachelor's Degree in Information Systems - UniEuro - Euroamerican University - 2000 - 2004

Pinned Loading

  1. shiny-application-and-reproducible-pitch shiny-application-and-reproducible-pitch Public

    Reproducible Pitch Presentation - Explore the relation between two mtcars variables

    R

  2. CenterTrack CenterTrack Public

    Forked from xingyizhou/CenterTrack

    Simultaneous object detection and tracking using center points.

    Python

  3. DCNv2 DCNv2 Public

    Forked from CharlesShang/DCNv2

    Deformable Convolutional Networks v2 with Pytorch

    C++

  4. langchain-tutorials langchain-tutorials Public

    Forked from gkamradt/langchain-tutorials

    Overview and tutorial of the LangChain Library

    Jupyter Notebook