What I Read: gradient descent.
https://centralflows.github.io/part1/
Part I: how does gradient descent work?
Jeremy Cohen. Alex Damian, Ameet Talwalkar, J. Zico Kolter, Jason D. Lee
25 Sep 2025
"Perhaps surprisingly, traditional analyses of gradient descent cannot capture the typical dynamics of gradient descent in deep learning. We'll first explain why..."