arxiv
PublishedApril 1, 2026 at 4:00 AM
—neutral
The Last Fingerprint: How Markdown Training Shapes LLM Prose
Publisher summary· verbatim
arXiv:2603.27006v1 Announce Type: cross Abstract: Large language models produce em dashes at varying rates, and the observation that some models "overuse" them has become one of the most widely discussed markers of AI-generated text. Yet no mechanistic account of this pattern exists, and the paralle
Models mentioned
02Related
04- arxiv17dEmoMind: Decoding Affective Captions from Human Brain fMRI
- arxiv25dEnd-to-end PDDL Planning with Hardcoded and Dynamic Agents
- arxivApr 28ComplianceNLP: Knowledge-Graph-Augmented RAG for Multi-Framework Regulatory Gap Detection
- arxivApr 21SatBLIP: Context Understanding and Feature Identification from Satellite Imagery with Vision-Language Learning
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivSFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning1harxivMSTN: A Lightweight and Fast Model for General TimeSeries Analysis1harxivOptical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning1harxivDynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models1hThe Bubble Brief
WEEKLYRead language models insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗