Truncated Rectified Flow Policy for Reinforcement Learning with One-Step Sampling - Databubble