Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days
How I AI
Feb 11
Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days
Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days

How I AI
Feb 11
Shownote
Shownote
I put the newest AI coding models from OpenAI and Anthropic head-to-head, testing them on real engineering work I’m actually doing. I compare GPT-5.3 Codex with Opus 4.6 (and Opus 4.6 Fast) by asking them to redesign my marketing website and refactor some ...
Highlights
Highlights
In this episode, Claire Vo puts the latest AI coding models—OpenAI’s GPT-5.3 Codex and Anthropic’s Claude Opus 4.6 (including the faster variant)—to the test on real engineering tasks she’s actively shipping.
Chapters
Chapters
Introduction to new AI coding models
00:00My test methodology for comparing models
02:13Codex’s unique features: Git primitives, skills, and automations
03:30Testing GPT-5.2 Codex on a website redesign task
09:05Challenges with Codex’s literal interpretation of prompts
10:40Comparing the before and after with Codex
15:00Testing Opus 4.6 on the same website redesign task
16:23Comparing the visual results of both models
20:56Real-world engineering impact: 44 PRs in five days
21:30Refactoring components with Opus 4.6
23:03Using Codex for code review and architectural analysis
24:30Cost considerations for Opus 4.6 Fast
26:55Conclusion
28:52Transcript
Transcript
Claire Vo: Welcome back to How I AI. I'm Claire Vo, product leader and AI obsessive, here on a mission to help you build better with these new tools. Today, we're going to bring you up to date on all the new coding model releases from OpenAI and Anthropic....