The Shape of Compute (Chris Lattner of Modular)
Latent Space: The AI Engineer Podcast
2025/06/13
The Shape of Compute (Chris Lattner of Modular)
The Shape of Compute (Chris Lattner of Modular)

Latent Space: The AI Engineer Podcast
2025/06/13
Shownote
Shownote
Chris Lattner of Modular (https://modular.com) joined us (again!) to talk about how they are breaking the CUDA monopoly, what it took to match NVIDIA performance with AMD, and how they are building a company of "elite nerds".
X: https://x.com/latentspacepod
Substack: https://latent.space
00:00:00 Introductions
00:00:12 Overview of Modular and the Shape of Compute
00:02:27 Modular’s R&D Phase
00:06:55 From CPU Optimization to GPU Support
00:11:14 MAX: Modular’s Inference Framework
00:12:52 Mojo Programming Language
00:18:25 MAX Architecture: From Mojo to Cluster-Scale Inference
00:29:16 Open Source Contributions and Community Involvement
00:32:25 Modular's Differentiation from VLLM and SGLang
00:41:37 Modular’s Business Model and Monetization Strategy
00:53:17 DeepSeek’s Impact and Low-Level GPU Programming
01:00:00 Inference Time Compute and Reasoning Models
01:02:31 Personal Reflections on Leading Modular
01:08:27 Daily Routine and Time Management as a Founder
01:13:24 Using AI Coding Tools and Staying Current with Research
01:14:47 Personal Projects and Work-Life Balance
01:17:05 Hiring, Open Source, and Community Engagement
Highlights
Highlights
In this episode, Chris Lattner of Modular discusses the company's efforts to revolutionize GPU programming and democratize AI. The conversation explores how Modular is breaking CUDA's monopoly, achieving high performance on AMD GPUs, and fostering a community of 'elite nerds' through open-source contributions and innovative technologies.
Chapters
Chapters
Introductions
00:00Overview of Modular and the Shape of Compute
00:12Modular’s R&D Phase
02:27From CPU Optimization to GPU Support
06:55MAX: Modular’s Inference Framework
11:14Mojo Programming Language
12:52MAX Architecture: From Mojo to Cluster-Scale Inference
18:25Open Source Contributions and Community Involvement
29:16Modular's Differentiation from VLLM and SGLang
32:25Modular’s Business Model and Monetization Strategy
41:37DeepSeek’s Impact and Low-Level GPU Programming
53:17Inference Time Compute and Reasoning Models
1:00:00Personal Reflections on Leading Modular
1:02:31Daily Routine and Time Management as a Founder
1:08:27Using AI Coding Tools and Staying Current with Research
1:13:24Personal Projects and Work-Life Balance
1:14:47Hiring, Open Source, and Community Engagement
1:17:05Transcript
Transcript
Alessio: Hey everyone, welcome to the Latent Space Podcast. This is Alessio, partner and CTO at Decibel, and I'm joined by my co-host, this week's founder of Modular. And we're so excited to be back in the studio with Chris Lattner from Modular. Welcome ba...