Reading List

Papers, websites, and cool people. Things I've read or am meaning to read. All entries before June 26, 2024 undated.

Pinned

Papers With Code

All

2025-03-30 Tracing the thoughts of a large language model

2025-03-30 On the Biology of a Large Language Model

2025-03-17 Introduction to Quantum Information Science Lecture Notes

2025-03-09 sesame

2025-03-09 Understanding Transformers... (beyond the Math)

2025-02-23 The Ultra-Scale Playbook: Training LLMs on GPU Clusters

2025-02-18 Useless Use of Cat Award

2025-02-16 "A calculator app? Anyone could make that."

2025-02-10 Training language models to follow instructions with human feedback

2025-02-09 Three Observations

2025-02-07 A Little Bit of Reinforcement Learning from Human Feedback

2025-02-07 sacred.computer

2025-01-29 Quotes

2025-01-29 How has DeepSeek improved the Transformer architecture?

2025-01-28 Essential template metaprogramming

2025-01-27 Evolving Deeper LLM Thinking

2025-01-25 How I became a machine learning practitioner

2025-01-23 PythonRobotics

2025-01-22 Announcing The Stargate Project

2025-01-21 Interfaces

2025-01-21 Predicting Human Brain States with Transformer

2025-01-20 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

2025-01-14 a reinforcement learning guide

2025-01-13 Agents

2025-01-11 200Bn Weights of Responsibility

2025-01-11 Godly

2025-01-11 Vim Kōans

2025-01-10 Static search trees: 40x faster than binary search

2025-01-08 omi: thought to action

2025-01-07 Fauna Robotics

2025-01-07 Perhaps Not The Answer You Were Expecting But You Asked For It

2024-12-28 The 2025 AI Engineering Reading List

2024-12-19 GPU Glossary

2024-12-19 MoonBit Compiler

2024-12-17 Robotic Manipulation

2024-12-17 Underactuated Robotics

2024-12-14 Teaching GHC how to play Minesweeper

2024-12-06 Introducing OpenAI o1

2024-12-04 John Galt (Pseudonymous Designer)

2024-12-02 Simple but Powerful Pratt Parsing

2024-12-02 On the Glucose SAT solver

2024-12-01 Glucose SAT Solver

2024-12-01 Thirty Observations at Thirty

2024-11-29 The UX of LEGO Interface Panels

2024-11-29 Rowan Zellers (Research @ OpenAI)

2024-11-28 Dear future undergraduate researcher

2024-11-28 Competitive Programmer's Handbook

2024-11-23 Untangling Lifetimes: The Arena Allocator

2024-11-23 Rachit Nigam (EECS Professor @ MIT)

2024-11-19 Guillermo Angeris (Chief Scientist @ Bain Capital Crypto)

2024-11-19 Floating Point Instructions

2024-11-19 Lujun's Resources

2024-11-18 SciML

2024-11-16 OpenAI Email Archives (from Musk v. Altman)

2024-11-16 Implementing the Simple C Compiler

2024-11-14 Expository papers

2024-11-13 Personality Basins

2024-11-12 reflections on palantir

2024-11-12 Jason Liu (Instructor Guy)

2024-11-12 Jack Morris (Cornell Tech, FAIR, Google Brain)

2024-11-12 Deep Learning, NLP, and Representations

2024-11-05 Setting Your Pet Rock Free.

2024-08-14 Algorithms for Decision Making

2024-08-12 Evaluating ∇f(x) is as fast as f(x)

2024-08-12 Reinforcement Learning from Human Feedback Book

2024-08-12 Mathematical Foundations of Reinforcement Learning

2024-08-11 Animated AI

2024-08-07 Learning to Move with Affordance Maps

2024-08-07 Imitation Learning

2024-08-06 Algorithms for Modern Hardware

2024-08-05 bytecode interpreters for tiny computers

2024-08-03 Latency Numbers Every Engineer Should Know

2024-07-28 The Matrix Calculus You Need For Deep Learning

2024-07-28 A Recipe for Training Neural Networks

2024-07-28 Practical Deep Learning

2024-07-25 A (Long) Peer into Reinforcement Learning

2024-07-23 Open Source AI Is the Path Forward

2024-07-23 The Llama 3 Herd of Models

2024-07-21 Mamba: The Hard Way

2024-07-18 CleanRL (Clean Implementation of RL Algorithms)

2024-07-18 Policy Gradient Demystified

2024-07-17 Prover-Verifier Games improve legibility of language model outputs

2024-07-16 Codestral Mamba

2024-07-14 An extended collection of matrix derivative results for forward and reverse mode algorithmic differentiation

2024-07-12 Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers

2024-07-12 Modularized Implementation of Deep RL Algorithms in PyTorch

2024-07-10 ML Code Challenges

2024-07-10 lucidrains

2024-07-05 From Autoencoder to Beta-VAE

2024-07-05 Tutorial on Variational Autoencoders

2024-07-01 The Super Effectiveness of Pokémon Embeddings Using Only Raw JSON and Images

2024-07-01 Popular Model-free Reinforcement Learning Algorithms

2024-06-29 Auto-Encoding Variational Bayes

2024-06-29 Stanford CS234: Reinforcement Learning Spring 2024

2024-06-29 Policy Gradient Algorithms

2024-06-28 How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog

2024-06-27 Meta Large Language Model Compiler: Foundation Models of Compiler Optimization

2024-06-27 KAN: Kolmogorov-Arnold Networks

2024-06-26 Higher-order Virtual Machine 2

2024-06-26 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities

The Wi-Fi only works when it's raining

Yacine (Software @ X)

Andrej Karpathy (OpenAI, Tesla)

Lilian Weng (Safety @ OpenAI)

Einops: Clear and Reliable Tensor Manipulations with Einstein-like Notation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Formal Algorithms for Transformers

A high-bias, low-variance introduction to Machine Learning for physicists

Attention is All You Need

Generative Adversarial Nets

Gradient-Based Learning Applied to Document Recognition

Diffusion Models

UGeneva 14x050 Deep Learning Course

The Transformer Family Version 2.0

Algebra, Topology, Differential Calculus, and Optimization Theory For Computer Science and Machine Learning

UToronto CSC321 Lecture 10: Automatic Differentiation

ML Interviews Book

The Matrix Calculus You Need For Deep Learning

Karpathy's MinBPE

index.globe.engineer

Ishan's Idea List

The Annotated Transformer

The Little Book of Deep Learning

Transformers from Scratch