Peer-Predictive Self-Training for Language Model Reasoning - Databubble