arxiv
PublishedApril 24, 2026 at 4:00 AM
—neutral
Coverage, Not Averages: Semantic Stratification for Trustworthy Retrieval Evaluation
Publisher summary· verbatim
arXiv:2604.20763v1 Announce Type: cross Abstract: Retrieval quality is the primary bottleneck for accuracy and robustness in retrieval-augmented generation (RAG). Current evaluation relies on heuristically constructed query sets, which introduce a hidden intrinsic bias. We formalize retrieval evalua
Discussion
No replies yet. Be first.
Originally published on arxiv ↗