FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning - Databubble