arxiv3d agobullish

LoRA-FA: Efficient and Effective Low Rank Representation Fine-tuning

arXiv:2308.03303v2 Announce Type: replace Abstract: Fine-tuning large language models (LLMs) is crucial for improving their performance on downstream tasks, but full-parameter fine-tuning (Full-FT) is computationally expensive and memory-intensive. Parameter-efficient fine-tuning (PEFT) methods, suc

LOLO2 models #fine-tuning #language-models #optimization

arxiv5d ago

Self-Distillation as a Performance Recovery Mechanism for LLMs: Counteracting Compression and Catastrophic Forgetting

arXiv:2604.15794v1 Announce Type: cross Abstract: Large Language Models (LLMs) have achieved remarkable success, underpinning diverse AI applications. However, they often suffer from performance degradation due to factors such as catastrophic forgetting during Supervised Fine-Tuning (SFT), quantizat

#self-distillation #fine-tuning #large-language-models Read on arxiv →

arxivApr 18

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

arXiv:2604.14258v1 Announce Type: cross Abstract: Large language models are typically post-trained using supervised fine-tuning (SFT) and reinforcement learning (RL), yet effectively unifying efficient knowledge injection with robust generalization remains challenging. In this work, we provide a tra

#language-models #fine-tuning #reinforcement-learning Read on arxiv →

arxivApr 17bullish

Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions

arXiv:2502.16761v2 Announce Type: replace Abstract: Large language models (LLMs) present novel opportunities in public opinion research by predicting survey responses in advance during the early stages of survey design. Prior methods steer LLMs via descriptions of subpopulations as LLMs' input promp

#survey-research #language-models #fine-tuning Read on arxiv →

arxivApr 13

Demystifying Mergeability: Interpretable Properties to Predict Model Merging Success

arXiv:2601.22285v4 Announce Type: replace Abstract: Model merging combines knowledge from separately fine-tuned models, yet success factors remain poorly understood. While recent work treats mergeability as an intrinsic property, we show with an architecture-agnostic framework that it fundamentally

#machine-learning #model-merging #fine-tuning Read on arxiv →

arxivApr 9bullish

Visual prompting reimagined: The power of the Activation Prompts

arXiv:2604.06440v1 Announce Type: cross Abstract: Visual prompting (VP) has emerged as a popular method to repurpose pretrained vision models for adaptation to downstream tasks. Unlike conventional model fine-tuning techniques, VP introduces a universal perturbation directly into the input data to f

#computer-vision #fine-tuning #machine-learning Read on arxiv →

arxivApr 1

The Last Fingerprint: How Markdown Training Shapes LLM Prose

arXiv:2603.27006v1 Announce Type: cross Abstract: Large language models produce em dashes at varying rates, and the observation that some models "overuse" them has become one of the most widely discussed markers of AI-generated text. Yet no mechanistic account of this pattern exists, and the paralle

MEOP2 models #language models #training data #fine-tuning Read on arxiv →