arxiv
PublishedJune 5, 2026 at 4:00 AM
—neutral
Do LLMs Hold Their Values? MANTA: A Multi-Turn Adversarial Benchmark for Animal Welfare Reasoning
Publisher summary· verbatim
arXiv:2605.16301v2 Announce Type: replace-cross Abstract: Evaluating animal welfare reasoning in LLMs remains an open challenge despite rapid deployment in consumer and professional contexts where welfare considerations appear implicitly in everyday queries. Existing benchmarks such as AnimalHarmBen
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivSFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning8harxivOptical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning8harxivDynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models8harxivTemporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents8hThe Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗