Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression

Source

arxiv.orgfull article ↗

Publisher summary· verbatim

arXiv:2505.13527v4 Announce Type: replace-cross Abstract: Despite substantial advancements in aligning large language models (LLMs) with human values, current safety mechanisms remain susceptible to jailbreak attacks. We hypothesize that this vulnerability stems from distributional discrepancies bet

Discussion

No replies yet. Be first.

Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression

Related coverage

Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression

Related coverage