Tag

#post-training

3 articles tagged #post-training

arxivJun 1

Representation Collapse in Sequential Post-Training of Large Language Models

arXiv:2605.30524v1 Announce Type: new Abstract: Large language models are now adapted through chains of post-training stages rather than through a single instruction-tuning pass. This paper studies whether such sequential post-training gradually compresses internal representations into low-rank, ani

#machine-learning #post-training #representation Read on arxiv →

arxivApr 18

Large Language Model Post-Training: A Unified View of Off-Policy and On-Policy Learning

arXiv:2604.07941v2 Announce Type: replace-cross Abstract: Post-training has become central to turning pretrained large language models (LLMs) into aligned, capable, and deployable systems. Recent progress spans supervised fine-tuning (SFT), preference optimization, reinforcement learning (RL), proce

#post-training #language-models #survey Read on arxiv →

arxivApr 8bullish

StateX: Enhancing RNN Recall via Post-training State Expansion

arXiv:2509.22630v2 Announce Type: replace-cross Abstract: Recurrent neural networks (RNNs), such as linear attention and state-space models, have gained popularity due to their constant per-token complexity when processing long contexts. However, these recurrent models struggle with tasks that requi

#rnn #state-space #post-training Read on arxiv →

Tag

#post-training

3 articles tagged #post-training

arxivJun 1