@ZhanQiusi1 will be presenting our work at the Wednesday 11AM poster session and the Saturday TrustNLP workshop (spotlight talk)! Say hi if you see her
Daniel Kang
Daniel Kang13.3.2025
AI agents are increasingly popular (e.g., OpenAI's operator) but can be attacked to harm users! We show that even with defenses, AI agents can still be compromised via indirect prompt injections via "adaptive attacks" in our NAACL 2025 findings paper 🧵 and links below
167