arxiv
PublishedApril 24, 2026 at 4:00 AM
—neutral
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models
Publisher summary· verbatim
arXiv:2509.24239v4 Announce Type: replace-cross Abstract: Recent large language models (LLMs) have shown strong reasoning capabilities. However, a critical question remains: do these models possess genuine strategic reasoning, or do they primarily excel at pattern recognition? To address this, we pr
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivConsequentialist Objectives and Catastrophe8harxivEgoMAGIC- An Egocentric Video Field Medicine Dataset for Training Perception Algorithms8harxivReCast: Recasting Learning Signals for Reinforcement Learning in Generative Recommendation8harxivA Probabilistic Framework for Hierarchical Goal Recognition8hOriginally published on arxiv ↗