Skip to content
View abideenml's full-sized avatar
Block or Report

Block or report abideenml

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
abideenml/README.md

ML engineer adept at LLM pretraining, fine-tuning, rlhf, rag, and agentic workflows.

πŸ”¬ Recent OS Projects

  • llm.pth - Hackable implementations of Autoregressive models (Llama, mixtral, gemma, deepseek), Research papers (cope, yarn, mod, mome, mla) and techniques (sft, dpo, kto, ipo) in Pytorch.
  • LightAgents - A wrapper free Agents library with RAG, function calling, json mode, telemetry and multi-layer memory.
  • llama3.cuda - llama3.cuda is an implementation of Llama 3.1 in pure C/CUDA. Consists of Swiglu, RoPE, CSE, RMSNorm and GQA kernels.

πŸ’» Recent Work Projects

  • Elemental Compute - Implemented a self-optimizing multimodal pipeline with RAG, Agentic workflow, and open-source AI using LLM-as-a-Judge and Mixture of Agents. Managed 30+ GPUs for multi-node inference of the entire multimodal pipeline consisting of LLama-3.1 70B, Phi-3-medium-128k-instruct, Llava-next-8b, and SDXL-Lightning.
  • John Snow Labs - Released a series of JSL-MedX 3B, 7B, 8B, and 70B LLMs in the Healthcare domain. JSL-MedX models are ranked No. 1 on the Open Medical Leaderboard across all Param variants.
  • QueryLoopAi - Pre-trained a 500M SLM from scratch on a carefully curated high-quality 15B tokens synthetic dataset. Created the entire training and evaluation pipeline along with managing training on 8xA100s. Created Kendrick, a mixture of experts model with 32k experts and Multi-latent head attention.

πŸ“ Recent Writing

View the archives (42 posts) @ zain.com.


Linkedin FollowMedium FollowDiscordTwitterSubstack

Pinned Loading

  1. llm.pth llm.pth Public

    Implementation of various Autoregressive models, Research papers and techniques. Main aim is to write clean, modular, wrapper-free implementations.

    Python

  2. LightAgents LightAgents Public

    A wrapper free Agents library with RAG, function calling, json mode, telemetry and multi-layer memory.

    Python

  3. llama3.cuda llama3.cuda Public

    llama3.cuda is an implementation of Llama 3.1 in pure C/CUDA.

    Cuda

  4. RealTime-StarRatingPrediction-with-AWSKinesis RealTime-StarRatingPrediction-with-AWSKinesis Public

    This repository contains an End to End Real time πŸ•°οΈ Machine Learning Pipeline to predict star ⭐️ rating of product reviews. This project uses AWS Sagemaker, Kinesis, Lambda, S3, Redshift, Athena, a…

    Jupyter Notebook

  5. End2End-Recommendation-Engine_ End2End-Recommendation-Engine_ Public

    πŸš€ End to End Movie Recommendation system built with NextJS, Flask, MySQL, TailwindCSS and AWS πŸ›Έ. Performs πŸ“ Content based, Collaborative and Neural Collaborative recommendations. --> Ongoing project

    TypeScript

  6. Kedro-MLops-pipeline Kedro-MLops-pipeline Public

    Churn Prediction with Kedro, Kedro-Viz, and Kedro-Mlflow ❄️ πŸ‘¨. PowerBI Dashboard πŸ“Š also included. KedroπŸ”— https://kedro.org/

    Jupyter Notebook 3 1