Adaptive KL-UCB Based Bandit Algorithms for Markovian and I.i.d. Settings

IEEE TRANSACTIONS ON AUTOMATIC CONTROL（2024）

Cited 0|Views38

Key words

Bandit Optimization,Regret Analysis,Contextual Bandits,Adversarial Multi-Armed Bandits,Reinforcement Learning

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined