Thonk From First Principles
Subscribe
Sign in
Home
Archive
About
Why PyTorch is an amazing place to work... and Why I'm Joining Thinking Machines
In which I convince to you to join either PyTorch or Thinking Machines!
Mar 4, 2025
•
Horace He
81
7
5
FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention [external]
Freeing users from the software lottery tyranny of fused attention implementations.
Aug 7, 2024
•
Horace He
20
2
2
Strangely, Matrix Multiplications on GPUs Run Faster When Given "Predictable" Data! [short]
Great minds discuss flops per watt.
Apr 29, 2024
•
Horace He
140
21
8
Solutions: What Shapes Do Matrix Multiplications Like?
Companion to https://www.thonking.ai/p/what-shapes-do-matrix-multiplications
Apr 8, 2024
•
Horace He
13
1
What Shapes Do Matrix Multiplications Like? [medium]
Divining order from the chaos
Apr 1, 2024
•
Horace He
91
2
3
Supporting Mixtral in gpt-fast through torch.compile [short]
Long-form version of this tweet thread: https://twitter.com/cHHillee/status/1762269069351461196
Feb 26, 2024
•
Horace He
and
Yanbo Liang
11
4
Thonk From First Principles
ML Systems from first principles. Aims to be better than a ChatGPT summary.
Subscribe
Recommendations
SemiAnalysis
Dylan Patel
Artificial Fintelligence
Finbarr Timbers
Thonk From First Principles
Subscribe
About
Archive
Recommendations
Sitemap
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts