The Last Fingerprint: How Markdown Training Shapes LLM Prose

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2603.27006v1 Announce Type: cross Abstract: Large language models produce em dashes at varying rates, and the observation that some models "overuse" them has become one of the most widely discussed markers of AI-generated text. Yet no mechanistic account of this pattern exists, and the paralle

Models mentioned