CoP: Agentic Red-teaming for Large Language Models using Composition of PrinciplesChen XiongPin-Yu Chenet al.2025NeurIPS 2025
Defensive Prompt Patch: A Robust and Generalizable Defense of Large Language Models against Jailbreak AttacksChen XiongXiangyu Qiet al.2025ACL 2025