reading list
papers, websites, and cool people. things i've read or am meaning to read. all entries before june 26, 2024 undated.
pinned
all
2025-01-22 Announcing The Stargate Project
2025-01-21 Interfaces
2025-01-21 Predicting Human Brain States with Transformer
2025-01-20 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
2025-01-14 a reinforcement learning guide
2025-01-13 Agents
2025-01-11 200Bn Weights of Responsibility
2025-01-11 Godly
2025-01-11 Vim Kōans
2025-01-10 Static search trees: 40x faster than binary search
2025-01-08 omi: thought to action
2025-01-07 Fauna Robotics
2025-01-07 Perhaps Not The Answer You Were Expecting But You Asked For It
2024-12-28 The 2025 AI Engineering Reading List
2024-12-19 GPU Glossary
2024-12-19 MoonBit Compiler
2024-12-17 Robotic Manipulation
2024-12-17 Underactuated Robotics
2024-12-14 Teaching GHC how to play Minesweeper
2024-12-06 Introducing OpenAI o1
2024-12-04 John Galt (Pseudonymous Designer)
2024-12-02 Simple but Powerful Pratt Parsing
2024-12-02 On the Glucose SAT solver
2024-12-01 Glucose SAT Solver
2024-12-01 Thirty Observations at Thirty
2024-11-29 The UX of LEGO Interface Panels
2024-11-29 Rowan Zellers (Research @ OpenAI)
2024-11-28 Dear future undergraduate researcher
2024-11-28 Competitive Programmer's Handbook
2024-11-23 Untangling Lifetimes: The Arena Allocator
2024-11-23 Rachit Nigam (EECS Professor @ MIT)
2024-11-19 Guillermo Angeris (Chief Scientist @ Bain Capital Crypto)
2024-11-19 Floating Point Instructions
2024-11-19 Lujun's Resources
2024-11-18 SciML
2024-11-16 OpenAI Email Archives (from Musk v. Altman)
2024-11-16 Implementing the Simple C Compiler
2024-11-14 Expository papers
2024-11-13 Personality Basins
2024-11-12 reflections on palantir
2024-11-12 Jason Liu (Instructor Guy)
2024-11-12 Jack Morris (Cornell Tech, FAIR, Google Brain)
2024-11-12 Deep Learning, NLP, and Representations
2024-11-05 Setting Your Pet Rock Free.
2024-08-14 Algorithms for Decision Making
2024-08-12 Evaluating ∇f(x) is as fast as f(x)
2024-08-12 Reinforcement Learning from Human Feedback Book
2024-08-12 Mathematical Foundations of Reinforcement Learning
2024-08-11 Animated AI
2024-08-07 Learning to Move with Affordance Maps
2024-08-07 Imitation Learning
2024-08-06 Algorithms for Modern Hardware
2024-08-05 bytecode interpreters for tiny computers
2024-08-03 Latency Numbers Every Engineer Should Know
2024-07-28 The Matrix Calculus You Need For Deep Learning
2024-07-28 A Recipe for Training Neural Networks
2024-07-28 Practical Deep Learning
2024-07-25 A (Long) Peer into Reinforcement Learning
2024-07-23 Open Source AI Is the Path Forward
2024-07-23 The Llama 3 Herd of Models
2024-07-21 Mamba: The Hard Way
2024-07-18 CleanRL (Clean Implementation of RL Algorithms)
2024-07-18 Policy Gradient Demystified
2024-07-17 Prover-Verifier Games improve legibility of language model outputs
2024-07-16 Codestral Mamba
2024-07-12 Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
2024-07-12 Modularized Implementation of Deep RL Algorithms in PyTorch
2024-07-10 ML Code Challenges
2024-07-10 lucidrains
2024-07-05 From Autoencoder to Beta-VAE
2024-07-05 Tutorial on Variational Autoencoders
2024-07-01 The Super Effectiveness of Pokémon Embeddings Using Only Raw JSON and Images
2024-07-01 Popular Model-free Reinforcement Learning Algorithms
2024-06-29 Auto-Encoding Variational Bayes
2024-06-29 Stanford CS234: Reinforcement Learning Spring 2024
2024-06-29 Policy Gradient Algorithms
2024-06-28 How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog
2024-06-27 Meta Large Language Model Compiler: Foundation Models of Compiler Optimization
2024-06-27 KAN: Kolmogorov-Arnold Networks
2024-06-26 Higher-order Virtual Machine 2
2024-06-26 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
The Wi-Fi only works when it's raining
Andrej Karpathy (OpenAI, Tesla)
Einops: Clear and Reliable Tensor Manipulations with Einstein-like Notation
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Formal Algorithms for Transformers
A high-bias, low-variance introduction to Machine Learning for physicists
Gradient-Based Learning Applied to Document Recognition
UGeneva 14x050 Deep Learning Course
The Transformer Family Version 2.0
UToronto CSC321 Lecture 10: Automatic Differentiation
The Matrix Calculus You Need For Deep Learning