News
12 hours ago
Why Measuring Time is Not Enough: a Practical Roofline Model for ML Training
When we want to speed up training, the first instinct is to measure time and start optimizing the slowest kernel. But raw measurem...
When we want to speed up training, the first instinct is to measure time and start optimizing the slowest kernel. But raw measurem...