arxiv
PublishedMay 28, 2026 at 4:00 AM
▲bullish
A Benchmark Construction and Evaluation Framework for Specialist Domains: Case Study on Defense-related Documents
Publisher summary· verbatim
arXiv:2604.17943v2 Announce Type: replace Abstract: RAG-based question-answering (QA) in specialist domains faces a cold-start problem: lack of evaluative benchmarks and absence of labeled data for post-training. We present DoRA (Domain-oriented RAG Assessment), a novel benchmark construction and ev
Models mentioned
01Related
04- arxiv12dGraphRAG on Consumer Hardware: Benchmarking Local LLMs for Healthcare EHR Schema Retrieval
- arxiv12dDrugRAG: Enhancing Pharmacy LLM Performance Through A Novel Retrieval-Augmented Generation Pipeline
- arxiv26dMeasuring Evaluation-Context Divergence in Open-Weight LLMs: A Paired-Prompt Protocol with Pilot Evidence of Alignment-Pipeline-Specific Heterogeneity
- arxivApr 4Countering Catastrophic Forgetting of Large Language Models for Better Instruction Following via Weight-Space Model Merging
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivFederatedSkill: Federated Learning for Agentic Skill Evolution9harxivToward a Modular Architecture for Embedded AI Agent Systems at the Edge9harxivA Graph Foundation Model with Spectral Parsing and Prototype-Guided Spatial Propagation9harxivAnomalies in Multivariate Time Series Benchmarks Are Mostly Univariate9hThe Bubble Brief
WEEKLYRead benchmark insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗