Andrew Fairless, Ph.D.

Andrew Fairless, Ph.D.

blue-green banner

2026-01-14

What I Read: History, Large Language Models
2025-10-22

What I Read: Dynamic Inventory Optimization
2025-08-21

What I Read: Data Quality
2025-07-29

What I Read: Scaling RL
2025-07-15

What I Read: reasoning research
2025-07-09

What I Read: RL Traffic Smoothing
2025-06-23

What I Read: LLM Agents
2025-06-19

What I Read: LLM Reasoning
2025-06-12

What I Read: reinforcement learning
2025-06-03

What I Read: Model, Product
2025-05-26

What I Read: RL, PPO, GRPO
2025-05-22

What I Read: Group relative policy optimization
2025-05-21

What I Read: reasoning LLMs
2025-05-15

What I Read: next weak learners
2025-04-30

What I Read: adaptive LLM
2025-04-01

What I Read: reward hacking
2024-12-04

What I Read: passively learned, causality
2024-11-18

What I Read: LLM Pre-training Post-training
2024-11-13

What I Read: Open-endedness, Agentic AI
2024-11-05

What I Read: Contextual Bandit, LinUCB
2024-10-10

What I Read: Hidden Infinity, Preference Learning
2024-10-07

What I Read: Extrinsic Hallucinations, LLMs
2024-09-04

What I Read: LLMs train LLMs
2024-09-03

What I Read: Summarization, LLMs
2024-05-08

State-Space Models: Learning the Kalman Filter

Different research fields may speak different mathematical languages. There’s nothing like rigorous software testing for accurate translation.
Read more ⟶
2024-02-19

What I Read: Instruction Tuning
2024-02-15

What I Read: Will Scaling Solve Robotics?
2023-12-12

What I Read: AI System Beats Chess Puzzles
2023-10-30

What I Read: LLM Training, RLHF
2023-09-07

What I Read: LLMs
2023-08-17

What I Read: LLM Agents
2023-07-10

What I Read: AIs producing own training data
2023-06-27

What I Read: Reinforcement Learning from Human Feedback
2023-06-21

What I Read: Reinforcement Learning, Language Models
2023-05-23

What I Read: human touch, LLMs
2023-05-09

What I Read: Competitive Machine Learning
2023-04-06

What I Read: Teach Computers Math
2023-03-21

What I Read: Machines Learn, Teach Basics
2023-02-22

What I Read: AI, Human Values
2023-02-21

What I Read: Offline RL, Large Language Models
2023-02-06

What I Read: Causal Confounds, Sequential Decision
2023-01-30

What I Read: Matrix Multiplication
2023-01-17

What I Read: Learning to Imitate
2022-12-22

What I Read: The Farama Foundation
2022-12-21

What I Read: Pre-Trained Models, Robotics
2022-12-20

What I Read: undesired goals
2022-07-27

What I Read: Against Naive AI Scaling
2022-07-20

What I Read: What is Reinforcement Learning
2022-07-14

What I Read: Exploring Virtual Worlds, AI
2022-05-11

What I Read: Policy Regulariser, Adversary
2022-02-16

What I Read: To Understand Language is to Understand Generalization
2022-02-08

What I Read: How to Train Decision-Making AIs
2022-01-10

What I Read: Neural-Control Family
2021-12-01

What I Read: Autonomous Building of Composable Models
2021-10-11

What I Learn: Robots Must Be Ephemeralized
2021-10-05

What I Read: Permutation-Invariant Neural Networks for Reinforcement Learning
2021-09-23

What I Read: How Generally Capable Agents Trained
2021-09-04

Deep reinforcement learning and Rainbow

How does a computer learn to play video games?
Read more ⟶
2021-08-19

What I Read: AI-Generating Algorithms, Evolutionary RL
2021-07-26

What I Read: Model Free
2021-07-07

What I Read: Five types of thinking
2021-05-25

What I Read: RL, Decentralized Multi-agent Navigation
2021-03-22

What I Read: Continual Learning, Amnesia, Neural Networks
2021-03-11

What I Read: Neural Text Generation
2021-02-20

What I Read: Interpretability in Machine Learning
2021-02-15

What I Read: Revisiting Sutton’s Bitter Lesson for AI
2021-02-02

What I Read: Reinforcement learning is supervised learning
2021-01-26

What I Read: Neural Architecture Search
2021-01-23

What I Read: Multi-Armed Bandits and Experimentation
2021-01-23

What I Read: Traffic prediction with Graph Neural Networks
2020-12-09

What I Read: Artificial Intelligence Will Do What We Ask