Internal Knowledge Without External Expression: Probing the Generalization Boundary of a Classical Chinese Language Model

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2604.14180v1 Announce Type: new Abstract: We train a 318M-parameter Transformer language model from scratch on a curated corpus of 1.56 billion tokens of pure Classical Chinese, with zero English characters or Arabic numerals. Through systematic out-of-distribution (OOD) testing, we investigat

Discussion

No replies yet. Be first.

Internal Knowledge Without External Expression: Probing the Generalization Boundary of a Classical Chinese Language Model

Related coverage