AI Filters Will Always Have Holes
The Quanta Podcast
Jan 06
AI Filters Will Always Have Holes
AI Filters Will Always Have Holes

The Quanta Podcast
Jan 06
Unprocessed episode, you can be the first!
Shownote
Shownote
Ask ChatGPT how to build a bomb, and it will flatly respond that it “can’t help with that.” But users have long played a cat-and-mouse game to try to trick language models into providing forbidden information. Just as quickly as these “jailbreaks” appear, ...
Highlights
Highlights
Chapters
Chapters
Transcript
Transcript