GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Theory

Source

arxiv.orgfull article ↗

Publisher summary· verbatim

arXiv:2602.12316v2 Announce Type: replace Abstract: Frontier AI systems are increasingly capable and deployed in high-stakes multi-agent environments. However, existing AI safety benchmarks largely evaluate single agents, leaving multi-agent risks such as coordination failure and conflict poorly und

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Theory

Related coverage

GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Theory

Related coverage