Xiao Zhang
Xiao Zhang
About
Research
Publication
Teaching
Service
Contact
Light
Dark
Automatic
Jailbreak Attack
Oct 7, 2025
GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs
We introduce Generative Adversarial Suffix Prompter (GASP), a novel framework that combines human-readable prompt generation with Latent Bayesian Optimization (LBO) to improve adversarial suffix creation in a fully black-box setting.
Advik Raj Basani
,
Xiao Zhang
PDF
Cite
Code
ArXiv
OpenReview
Cite
×