scripod.com

Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Andrej Karpathy discusses his evolving relationship with AI, from delegating most of his coding to agents to exploring autonomous research. He shares insights on the current limits of AI, the future of engineering, and the potential impact on jobs and education.
Karpathy describes a shift from being compute-bound to token-bound, where maximizing token throughput is key. He envisions mastery of coding agents through multi-agent collaboration and persistent systems, exemplified by his home automation agent, Dobby. He introduces AutoResearch, a project to automate AI research loops, and discusses the concept of 'Program.md' for meta-optimization. He notes that AI excels at verifiable tasks but struggles with nuanced ones like humor, highlighting a jagged intelligence landscape. He predicts model speciation may occur due to compute constraints, and explores extending AutoResearch to a decentralized platform for parallel collaboration. On jobs, he suggests cheaper software may increase demand via Jevons paradox. He argues open-source models are only 6-8 months behind closed-source, which will reduce systemic risks. He predicts physical robotics will lag behind digital AI, and introduces MicroGPT for educational purposes, noting a shift where documentation will be written for agents, not humans.
00:00
00:00
Delegating most coding to AI agents
04:54
04:54
Shift from compute-bound to token-bound
10:19
10:19
AI agent controls lights, HVAC, shades, pool, spa, and security
13:29
13:29
AI agents can replace many custom apps
17:58
17:58
AI agents autonomously tuned hyperparameters better than he could manually
25:23
25:23
AI struggles with nuanced requests like humor.
28:33
28:33
Performance varies sharply across tasks.
35:00
35:00
Verification is cheap but generation is expensive.
44:15
44:15
Outside labs, one can be a free agent, aligned with humanity.
50:59
50:59
Centralization of intelligence carries systemic risks.
53:54
53:54
Physical robotics will lag behind digital AI.
1:03:12
1:03:12
Documentation will be written for agents rather than humans
1:05:40
1:05:40
Focus on what agents cannot do