scripod.com

#244 - GPT-5.5 Instant, Grok 4.3, OpenAI vs Musk

Overview

Shownote

Highlights

Transcript

Chapters

Pins

#244 - GPT-5.5 Instant, Grok 4.3, OpenAI vs Musk

Last Week in AI

May 11

#244 - GPT-5.5 Instant, Grok 4.3, OpenAI vs Musk

#244 - GPT-5.5 Instant, Grok 4.3, OpenAI vs Musk

Last Week in AI

Last Week in AI

May 11

Overview Shownote Highlights Transcript Chapters Pins

This episode delivers a comprehensive, critical analysis of the most consequential AI developments from the past week—spanning model releases, safety incidents, industry shifts, policy evolution, and foundational research advances.

The podcast unpacks OpenAI’s GPT-5.5 Instant launch—its benchmark gains, high cyber-risk classification, and the surprising 'goblin' obsession traced to reward shaping in RL training. It contrasts xAI’s pragmatic Grok 4.3 rollout with Mistral’s unified Medium 3.5 and Anthropic’s enhanced Managed Agents featuring Dreaming, Outcomes, and multi-agent orchestration. Key business developments include Anthropic’s $900B valuation talks, its compute deal with SpaceX, and the Anthropic–FIS financial crime agent. On policy, the Pentagon’s classified AI deployments, expanded U.S. government pre-release reviews, and the NSA’s active testing of Anthropic’s Mythos on Microsoft software signal a national-security-driven regulatory shift. Research highlights include Natural Language Autoencoders for interpretable activation explanations, Introspection Adapters for self-reporting behaviors, and recursive multi-agent systems enabling efficient, latent-space reasoning—culminating in frontier coding agents that autonomously implement AlphaZero-style self-play pipelines. Throughout, the hosts emphasize systemic fragility, opaque feedback loops, and the urgent need for adaptive oversight.

01:15

01:15

Today's AI agents operate in silos

04:40

04:40

Mythos led to Firefox fixing a large number of bugs

13:18

13:18

GPT-5.5 Instant outperforms GPT-5.4 on cyber-related benchmarks despite being lighter-weight

17:58

17:58

The 'goblin' obsession originated from over-rewarding creature metaphors in a nerdy model variant during RL training

27:00

27:00

Grok 4.3 sits on the Pareto frontier of cost per unit intelligence

30:51

30:51

XAI's Grok 5 is planned for later this year and targets 10 trillion parameters

39:27

39:27

Multi-agent orchestration enables a lead agent to delegate tasks to sub-agents

40:43

40:43

When demand is low, spare AI infrastructure capacity is used for 'dreaming' to keep GPUs active and optimize the compute fleet.

51:00

51:00

Greg Brockman's diary showed a focus on reaching $1 billion net worth, which contradicts his court statement of putting mission first

55:45

55:45

SpaceX provides compute to Anthropic, potentially transforming xAI into an AI cloud provider

58:48

58:48

Anthropic shows interest in SpaceX's orbital data center plan for its IPO

59:42

59:42

Anthropic is in talks to raise funds at a $900 billion valuation, a big jump from February

1:03:16

1:03:16

Asset managers can force AI tool usage in portfolio companies due to PE ownership

1:04:04

1:04:04

Forward-deployed engineers act as an accelerant to AI adoption, differing from self-serve API models

1:08:45

1:08:45

AI firms accounted for over a third of private-graded deals in 2025, up from 17% a few years ago

1:11:17

1:11:17

Risk doesn't disappear but migrates to private credit funds, insurers, and asset-backed markets

1:15:48

1:15:48

Anthropic's Natural Language Autoencoders map LLM activations directly to plain English explanations

1:19:23

1:19:23

Natural Language Autoencoders encode activation into plain English and decode it back, trained via reinforcement learning to minimize reconstruction error

1:22:06

1:22:06

The technology is already in production across OpenAI's superclusters.

1:27:26

1:27:26

The US government is considering pre-deployment licensing for AI models like Mythos

1:31:55

1:31:55

NSA is examining Anthropic's Mythos to find zero-day vulnerabilities

1:35:33

1:35:33

680 behavior adapters are trained for different proclivities, then introspection LoRa adapters teach the model to report those behaviors

1:40:59

1:40:59

Passing activations instead of text preserves chain-of-thought integrity in multi-agent systems

1:48:47

1:48:47

Frontier coding agents can implement an AlphaZero self-play pipeline for Connect Four under non-ML expert human direction