Batched Single-Index Global Multi-Armed Bandits with Covariates

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2503.00565v3 Announce Type: replace-cross Abstract: The multi-armed bandits (MAB) framework is a widely used approach for sequential decision-making, where a decision-maker selects an arm in each round with the goal of maximizing long-term rewards. In many practical applications, such as perso

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

Batched Single-Index Global Multi-Armed Bandits with Covariates

Related coverage

Batched Single-Index Global Multi-Armed Bandits with Covariates

Related coverage