scripod.com
AI Filters Will Always Have Holes

Highlights

Transcript

Chapters

Pins

AI Filters Will Always Have Holes

OverviewShownote
Unprocessed episode, you can be the first!

Shownote

Ask ChatGPT how to build a bomb, and it will flatly respond that it “can’t help with that.” But users have long played a cat-and-mouse game to try to trick language models into providing forbidden information. Just as quickly as these “jailbreaks” appear, ...

Highlights

Chapters

Transcript