What I Read: Transformers by Hand.
https://towardsdatascience.com/deep-dive-into-transformers-by-hand-%EF%B8%8E-68b8be4bd813?gi=b2b3c1885179
Deep Dive into Transformers by Hand
Srijanie Dey, PhD
Apr 12, 2024
“…the two mechanisms that are truly the force behind the transformers are attention weighting and feed-forward networks (FFN).”