Can Coding Agents Be General Agents?
View PDF HTML (experimental) Abstract:As coding agents have seen rapid capability and adoption gains, users are applying them to general tasks beyond software engineering. In this post, we investigate whether coding agents can successfully generalize to end-to-end business process automation. We identify gaps in current evaluations, and conduct a case study to evaluate a coding agent on practical business tasks in an open-core Enterprise Resource Planning system. We find that the agent reliably completes simple tasks but exhibits characteristic failures on complex tasks, suggesting that bridging domain logic and code execution is a key bottleneck to generalizability. Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG) Cite as: arXiv:2604.13107 [cs.SE] (or arXiv:2604.13107v1 [cs.SE] for this version) https://doi.org/10.48550/arXiv.2604.13107 arXiv-issued DOI via DataCite Submission history From: Gokul Prabhakaran [view email] [v1] Fri, 10 Apr 2026 22:39:51 UTC (732 KB)
No replies yet. Be first.