Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2603.00454v2 Announce Type: replace-cross Abstract: Generative Flow Networks (GFlowNets) enable fine-tuning large language models to approximate reward-proportional posteriors, but they remain prone to mode collapse, manifesting as prefix collapse and length bias. We attribute this to two fact

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Related coverage

Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Related coverage