arxivJun 30

Agentic Tool Use in Large Language Models

arXiv:2604.00835v2 Announce Type: replace Abstract: Large language models are increasingly being deployed as autonomous agents yet their real world effectiveness depends on reliable tools for information retrieval, computation and external action. Existing studies remain fragmented across tasks, too

#language-models #information-retrieval #computation Read on arxiv →

arxivJun 18

Which Sections of a Research Paper Best Reveal Its Research Methods? Evidence from Library and Information Science

arXiv:2606.19051v1 Announce Type: new Abstract: Research methods are essential carriers of knowledge contribution in academic papers. Automatic multi-label classification of research methods can support knowledge services such as method retrieval, review generation, and research intelligence analysi

#academic-papers #information-retrieval #classification Read on arxiv →

arxivJun 12bullish

NightFeats @ MMU-RAGent NeurIPS 2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track

arXiv:2606.11199v1 Announce Type: cross Abstract: We present NightFeats, a structured multi-agent retrieval-augmented generation (RAG) system submitted to the MMU-RAGent competition at NeurIPS 2025, where it was awarded Best Dynamic Evaluation in the text-to-text track. Rather than targeting benchma

NICLNO3 models #research #competition #language-models Read on arxiv →

arxivMay 29

The Best of the Two Worlds: Harmonizing Semantic and Hash IDs for Sequential Recommendation

arXiv:2512.10388v2 Announce Type: replace-cross Abstract: Conventional Sequential Recommender Systems (SRS) typically assign unique hash IDs (HID) to construct item embeddings, which mainly capture collaborative signals from historical user-item interactions. However, such embeddings are vulnerable

#recommendation-systems #information-retrieval #artificial-intelligence Read on arxiv →

arxivMay 16

PolitNuggets: Benchmarking Agentic Discovery of Long-Tail Political Facts

arXiv:2605.14002v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) embedded in agentic frameworks have transformed information retrieval from static, long context question answering into open-ended exploration. Yet real world use requires models to discover and synthesize "long-tail" fact

#benchmark #information-retrieval #multilingual Read on arxiv →

arxivMay 16

A Deterministic Agentic Workflow for HS Tariff Classification: Multi-Dimensional Rule Reasoning with Interpretable Decisions

arXiv:2605.14857v1 Announce Type: new Abstract: Harmonized System (HS) tariff classification is a high-stakes, expert-level task in which a free-form product description must be mapped to a specific six- or eight-digit code under the General Interpretive Rules (GIR), section notes, chapter notes, an

QWQW2 models #tariff-classification #language-models #expert-systems Read on arxiv →

arxivMay 8bullish

Addressing Labelled Data Scarcity: Taxonomy-Agnostic Annotation of PII Values in HTTP Traffic using LLMs

arXiv:2605.06305v1 Announce Type: new Abstract: Automated privacy audits of web and mobile applications often analyse outbound HTTP traffic to detect Personally Identifiable Information (PII) leakage. However, existing learning-based detectors typically depend on scarce, manually labelled traffic an

LA1 model #privacy #security #annotation Read on arxiv →

arxivApr 20bullish

CHOP: Chunkwise Context-Preserving Framework for RAG on Multi Documents

arXiv:2604.15802v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) systems lose retrieval accuracy when similar documents coexist in the vector database, causing unnecessary information, hallucinations, and factual errors. To alleviate this issue, we propose CHOP, a framework that

LARA2 models #retrieval #language-models #information-retrieval Read on arxiv →

arxivApr 16bullish

Public Profile Matters: A Scalable Integrated Approach to Recommend Citations in the Wild

arXiv:2603.17361v2 Announce Type: replace-cross Abstract: Proper citation of relevant literature is essential for contextualising and validating scientific contributions. While current citation recommendation systems leverage local and global textual information, they often overlook the nuances of t

PRDA2 models #information-retrieval #citation-recommendation #artificial-intelligence Read on arxiv →

arxivApr 8bullish

Data-Driven Function Calling Improvements in Large Language Model for Online Financial QA

arXiv:2604.05387v1 Announce Type: cross Abstract: Large language models (LLMs) have been incorporated into numerous industrial applications. Meanwhile, a vast array of API assets is scattered across various functions in the financial domain. An online financial question-answering system can leverage

LA1 model #financial-qa #language-models #data-augmentation Read on arxiv →

arxivApr 4

From BM25 to Corrective RAG: Benchmarking Retrieval Strategies for Text-and-Table Documents

arXiv:2604.01733v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems critically depend on retrieval quality, yet no systematic comparison of modern retrieval methods exists for heterogeneous documents containing both text and tabular data. We benchmark ten retrieval strateg

BMHYHY3 models #information-retrieval #benchmark #financial-qa Read on arxiv →