Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

Latent Space: The AI Engineer Podcast

2025/06/19

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

Latent Space: The AI Engineer Podcast

2025/06/19

Overview Shownote Highlights Transcript Chapters Pins

This podcast delves into the intersection of AI and strategic games, focusing on advancements in Diplomacy AI, reasoning models, and multi-agent systems. The discussion highlights how AI has influenced human gameplay strategies, particularly through insights from the development of Cicero, which contributed to a world championship win. It also examines the challenges and opportunities in creating AI capable of passing the Turing test and achieving success in complex environments.

The podcast explores the impact of AI on strategic games, emphasizing Diplomacy and poker. Insights from Cicero's development improved human gameplay, showcasing AI's potential beyond direct competition. The conversation addresses AI safety, controllability, and the evolution of O-Series models, which challenge misconceptions about reasoning models' applicability in subjective domains. The System 1/System 2 paradigm is applied to AI, highlighting the need for intuitive capabilities to enhance deliberate reasoning. Test-time compute and reinforcement fine-tuning are discussed as methods to improve model adaptability and specialization. OpenAI's journey with reasoning models reflects a shift towards scaling paradigms, improving data efficiency and surpassing human capabilities. Multi-agent research focuses on cooperative and competitive AI systems, advocating for principled approaches over heuristics. Poker strategies contrast Game Theory Optimal with exploitative methods, revealing challenges in modeling opponents effectively. World modeling and self-play face limitations outside two-player zero-sum games, necessitating new frameworks. Advancements in generative media and robotics highlight diverse AI applications, while benchmarks and societal impacts underscore ongoing research directions.