LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2605.23901v1 Announce Type: cross Abstract: Existing scaling laws for Large Language Models (LLMs), predominantly monotonic power laws, fail to explain emerging non-monotonic phenomena such as catastrophic overtraining and quantization-induced degradation, where performance deteriorates despit

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Related coverage

LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Related coverage