DeepSeekMath Meets Order Book: Group-Aware Policy Optimization for High-Frequency Directional Trading

Source

arxiv.orgfull article ↗

Publisher summary· verbatim

arXiv:2605.25527v1 Announce Type: new Abstract: This paper studies reinforcement learning for high-frequency trading on limit order books by pairing an Order-Flow-based state model with policy-gradient methods. Instead of value-based RL techniques like tabular Q-learning, our approach deploys policy

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

DeepSeekMath Meets Order Book: Group-Aware Policy Optimization for High-Frequency Directional Trading

Related coverage

DeepSeekMath Meets Order Book: Group-Aware Policy Optimization for High-Frequency Directional Trading

Related coverage