Michael
Yu
Toggle navigation
about
notes
ml paper summaries
research
projects
scaling-laws
an archive of posts with this tag
Apr 18, 2026
Training Compute-Optimal Large Language Models