Jailbreaking Multimodal Large Language Models using Multi-Clip Video

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2606.02111v1 Announce Type: cross Abstract: As multimodal large language models (MLLMs) have advanced to process video inputs, concerns have emerged about their potential for malicious misuse. Prior jailbreak studies have shown that safety alignment in MLLMs can be bypassed through visual inpu

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

Jailbreaking Multimodal Large Language Models using Multi-Clip Video

Related coverage

Jailbreaking Multimodal Large Language Models using Multi-Clip Video

Related coverage