Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

How I AI

2025/12/03

Overview Shownote Highlights Transcript Chapters Pins

Shownote

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design qualit...

Highlights

In this episode, a direct comparison is made between three advanced AI models—Gemini 3 Pro, Opus 4.5, and Codex 5.1—tasked with redesigning a blog page from the same prompt. The focus is on evaluating their real-world performance in design quality, user experience, and technical execution.

06:15

Opus 4.5 created a to-do list and executed the redesign step-by-step with superior planning.

16:03

Gemini 3 Pro added related articles and SEO schema across blog posts

23:00

Opus 4.5 excels as the best designer in design, usability, and SEO

Chapters

Introduction to the AI design challenge