Michael
Yu
Toggle navigation
about
notes
ml paper summaries
(current)
research
projects
ML paper summaries
Summaries and personal thoughts of papers (mostly in machine learning) I have read in detail.
(Apr 18, 2026) Training Compute-Optimal Large Language Models
(Apr 18, 2026) Learning and Leveraging World Models in Visual Representation Learning
(Apr 2, 2026) FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness