A Benchmark Construction and Evaluation Framework for Specialist Domains: Case Study on Defense-related Documents

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2604.17943v2 Announce Type: replace Abstract: RAG-based question-answering (QA) in specialist domains faces a cold-start problem: lack of evaluative benchmarks and absence of labeled data for post-training. We present DoRA (Domain-oriented RAG Assessment), a novel benchmark construction and ev

Models mentioned

01
Llama-3.1-8B
meta-llama/Llama-3.1-8B
DL 1.5M0.0%IN $0.10/Mtok

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

A Benchmark Construction and Evaluation Framework for Specialist Domains: Case Study on Defense-related Documents

Related coverage

A Benchmark Construction and Evaluation Framework for Specialist Domains: Case Study on Defense-related Documents

Related coverage