Catching AI Sleeper Agent - LLM Backdoors
Build Wiz AI Show
Feb 05
Catching AI Sleeper Agent - LLM Backdoors
Catching AI Sleeper Agent - LLM Backdoors

Build Wiz AI Show
Feb 05
Unprocessed episode, you can be the first!
Shownote
Shownote
Could your trusted AI model be a hidden "sleeper agent" just waiting for a secret command to turn malicious? We explore a new methodology that extracts and reconstructs backdoor triggers by exploiting the surprising fact that these models often strongly me...
Highlights
Highlights
Chapters
Chapters
Transcript
Transcript