Contextual Slate GLM Bandits with Limited Adaptivity

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2606.31449v1 Announce Type: new Abstract: We investigate the contextual slate bandit problem with generalized linear rewards under limited adaptivity. At each round, the learner is presented with $N$ sets of items, where each item is represented by a $d$-dimensional feature vector. The learner

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

Contextual Slate GLM Bandits with Limited Adaptivity

Related coverage

Contextual Slate GLM Bandits with Limited Adaptivity

Related coverage