Andrew Fairless, Ph.D.
/About/Bio
/Projects
/Reposts
/Tags
/Categories
Entries tagged :: training
.
2025-08-14
What I Read: Scale
2025-07-15
What I Read: reasoning research
2025-05-26
What I Read: RL, PPO, GRPO
2025-05-22
What I Read: Group relative policy optimization
2025-05-01
What I Read: memorization, novelty
2025-04-14
What I Read: Age of Data
2025-02-26
What I Read: benchmark
2024-12-19
What I Read: Toy Models, Superposition?
2024-10-30
What I Read: Visual Guide, Quantization
2024-09-25
What I Read: bare metal to 70B
2024-06-25
What I Read: How Machines ‘Grok’ Data
2024-06-12
What I Read: Data Selection, LLMs
2024-05-28
What I Read: Cloud native data loaders ML
2024-04-23
What I Read: Diffusion Model theory
2024-04-18
What I Read: Scaling ChatGPT, Engineering Challenges
2024-04-17
What I Read: High-Quality Human Data
2024-02-05
What I Read: Finetuning LLMs Using LoRA
2024-01-29
What I Read: Gaussian Processes Extrapolate
2024-01-18
What I Read: Unify Batch and ML Systems
2024-01-03
What I Read: Finetuning LLMs with LoRA and QLoRA
2023-11-14
What I Read: Auditing AI, How Much Access
2023-10-25
What I Read: Nvidia AI Supremacy Temporary
2023-10-23
What I Read: LLMs, single example
2023-10-02
What I Read: Models Memorize or Generalize?
2023-09-28
What I Read: scaling laws, cross-entropy loss
2023-09-13
What I Read: Attention Off By One
2023-09-06
What I Read: Accelerating PyTorch
2023-07-10
What I Read: AIs producing own training data
2023-03-15
What I Read: Optimizing Machine Learning Training Pipelines
2023-01-19
What I Read: Transformers Training
2022-09-12
What I Read: BLOOM Training
2022-03-30
What I Read: Researchers Build AI That Builds AI
2021-11-08
What I Read: How Train Large Deep Learning Models
2021-10-26
What I Read: How to Train Really Large Models
2021-05-12
What I Read: Continuous Training Strategy
2021-02-07
What I Read: Reproducing Deep Double Descent
2021-02-06
What I Read: Deep Double Descent: Where Bigger Models and More Data Hurt
2021-01-17
Case Study: How to Translate a Healthcare Problem into a Predictive Modeling Problem
How do we correctly select cases for our training data?
Read more ⟶
2020-11-29
What I Read: The Ultimate Guide to Model Retraining