scripod.com

Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

How I AI

2025/12/03
How I AI

How I AI

2025/12/03

Shownote

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design qualit...

Highlights

In this episode, a direct comparison is made between three advanced AI models—Gemini 3 Pro, Opus 4.5, and Codex 5.1—tasked with redesigning a blog page from the same prompt. The focus is on evaluating their real-world performance in design quality, user experience, and technical execution.
06:15
Opus 4.5 created a to-do list and executed the redesign step-by-step with superior planning.
16:03
Gemini 3 Pro added related articles and SEO schema across blog posts
23:00
Opus 4.5 excels as the best designer in design, usability, and SEO

Chapters

Introduction to the AI design challenge
00:00
The question: Which model is the better designer?
01:25
The prompt used for all three models
03:08
Gemini 3 Pro’s approach and results
04:10
Opus 4.5’s approach and results
06:00
Codex 5.1’s approach and disappointing results
10:54
Comparing the three designs side by side
14:51
Analyzing the change logs and SEO improvements from each model
16:03
Final verdict
22:43
Conclusion and next steps
23:00

Transcript

Claire Vo: Welcome back to How I AI. I'm Claire Vo, product leader and AI obsessive, here on a mission to help you build better with these new tools. Today, I have a really fun mini episode where I'm going to answer the question on everyone's mind. Which o...