After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Latent Space: The AI Engineer Podcast

2025/11/25

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Latent Space: The AI Engineer Podcast

2025/11/25

Overview Shownote Highlights Transcript Chapters Pins

The conversation dives into the evolution of AI beyond language, exploring how spatial intelligence is reshaping the way machines understand and generate 3D environments. From foundational research to real-world applications, the discussion bridges decades of computer vision progress with a bold new direction in generative modeling.

The podcast explores the emergence of spatial intelligence as a critical frontier in AI, driven by World Labs' new generative world model, Marble. Building on early breakthroughs like ImageNet and dense captioning, the team highlights how visual and physical understanding surpass the limits of language-based models. Marble uses Gaussian splats to create interactive, editable 3D scenes from text or images, enabling precise camera control and cross-device rendering. The discussion emphasizes that while current models excel at pattern recognition, they lack true causal reasoning—such as deriving physical laws like F=ma—pointing to the need for embodied, multimodal systems. Spatial intelligence, rooted in perception and interaction, is framed as complementary to language, not a replacement. Applications span creative fields like film and design, robotics simulation, and scientific discovery. The vision centers on integrating physics into neural models and democratizing access to world-scale AI, urging researchers to pursue high-risk, long-term innovation despite resource disparities in academia and industry.