Minimax PAC Bounds for Learning in Exogenous Contextual MDPs

Source

arxiv.orgfull article ↗

Publisher summary· verbatim

arXiv:2606.25170v1 Announce Type: cross Abstract: We study PAC learning in tabular discounted Markov decision processes with exogenous i.i.d. contexts, with discount factor $\gamma$, finite state space $\mathcal X$, action space $\mathcal A$, and context space $\mathcal Z$. At each time step, a cont

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

Minimax PAC Bounds for Learning in Exogenous Contextual MDPs

Related coverage

Minimax PAC Bounds for Learning in Exogenous Contextual MDPs

Related coverage