arxivMay 12

Building Korean linguistic resource for NLU data generation of banking app CS dialog system

arXiv:2605.10241v1 Announce Type: new Abstract: Natural language understanding (NLU) is integral to task-oriented dialog systems, but demands a considerable amount of annotated training data to increase the coverage of diverse utterances. In this study, we report the construction of a linguistic res

DIHAKO4 models · +1 #nlu #dialog systems #annotation

arxivMay 8bullish

Addressing Labelled Data Scarcity: Taxonomy-Agnostic Annotation of PII Values in HTTP Traffic using LLMs

arXiv:2605.06305v1 Announce Type: new Abstract: Automated privacy audits of web and mobile applications often analyse outbound HTTP traffic to detect Personally Identifiable Information (PII) leakage. However, existing learning-based detectors typically depend on scarce, manually labelled traffic an

LA1 model #privacy #security #annotation Read on arxiv →

arxivApr 23

Structured Disagreement in Health-Literacy Annotation: Epistemic Stability, Conceptual Difficulty, and Agreement-Stratified Inference

arXiv:2604.19943v1 Announce Type: new Abstract: Annotation pipelines in Natural Language Processing (NLP) commonly assume a single latent ground truth per instance and resolve disagreement through label aggregation. Perspectivist approaches challenge this view by treating disagreement as potentially

#nlp #annotation #health-literacy Read on arxiv →