arxiv
PublishedJune 2, 2026 at 4:00 AM
—neutral
When Single Answer Is Not Enough: Rethinking Single-Step Retrosynthesis Benchmarks for LLMs
Publisher summary· verbatim
arXiv:2602.03554v2 Announce Type: replace-cross Abstract: Recent progress has expanded the use of large language models (LLMs) in drug discovery, including synthesis planning. However, objective evaluation of retrosynthesis performance remains limited. Existing benchmarks and metrics typically rely
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivSFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning7harxivOptical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning7harxivDynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models7harxivTemporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents7hThe Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗