StructRL: Recovering Dynamic Programming Structure from Learning Dynamics in Distributional Reinforcement Learning - Databubble