blue-green banner

https://gr.inc/blog/scaling-rl-compute/
Scaling RL Compute
General Reasoning
March 21, 2025
“Training language models with the right objective allows them to learn that using more inference compute is beneficial for performance. This behavior, known as inference-time scaling, emerges with sufficient RL compute…”