Skip to content

Latest commit

 

History

History
459 lines (232 loc) · 46.3 KB

LargeLanguageModelsResources.md

File metadata and controls

459 lines (232 loc) · 46.3 KB

Large Language Models Resources

articles

Human-like Reasoning and Representation Learning

Theorem Proving

LLM Tokenization

Context Window representations and implementations

Time-series forecasting and classification tasks

Retrieval-Augmented Generation (RAG)

Retrieval-Augmented Fine Tuning (RAFT)

relevant repos:

The Attention Mechanicsm in Large Language Models

Compiler Optimization using LLM

Evaluation of LLMs

online videos and blogs

Resource on LLM visualization

The resource below attempts to visualize what is happening in LLM under the hood and is a helpful tool to comprehend the work of decoder-only Transformer-based LLMs. The author Brendan Bycroft has made an interesting attempt to visualize these structures and clarify how they operate. This webpage in the link below provides visualization for a family of GPT models, presented in 3D animations with walkthrough. The tool provides a step-by-step guide for single-token inference, coupled with interactive elements for a hands-on experience.

https://bbycroft.net/llm

Articles on LLMs in Cornell University's Advancing AI for Humanity blog

The blog: https://thegenerality.com/agi/

some of the articles:

medium