Andrew Fairless, Ph.D.

Andrew Fairless, Ph.D.

blue-green banner

2025-07-29

What I Read: Scaling RL
2025-07-15

What I Read: reasoning research
2025-07-09

What I Read: RL Traffic Smoothing
2025-06-12

What I Read: reinforcement learning
2025-05-26

What I Read: RL, PPO, GRPO
2025-05-22

What I Read: Group relative policy optimization
2025-04-01

What I Read: reward hacking
2024-10-10

What I Read: Hidden Infinity, Preference Learning
2023-10-30

What I Read: LLM Training, RLHF
2023-09-07

What I Read: LLMs