Inside Anthropic: How Claude Actually Gets Built | Alex Albert

Behind the Craft

3 DAYS AGO

Overview Shownote Highlights Transcript Chapters Pins

In this podcast, Alex Albert, a research PM at Anthropic, provides an inside look at how the team develops the next Claude model, covering everything from product development and user feedback integration to character training and consciousness research.

Anthropic treats each new Claude model as a product, setting requirements early and discovering capabilities during training. They use user feedback, analyzed by Claude itself, to refine features like adaptive thinking and memory, which includes a 'dreaming' process for pruning memories. Decision-making focuses on identifying irreversible 'one-way doors' for careful deliberation, while reversible decisions are treated as cheap. Internally, Claude is used for coding, data access, and brainstorming. Evaluating models involves creating test cases to identify weaknesses and collaborating on interventions like pre-training or RL. Character training combines quantifiable metrics with human intuition from transcripts. Anthropic also researches Claude's consciousness to improve interactions and trustworthiness, without taking an official stance.