Search Results for author: Xianyi Wu

Found 2 papers, 0 papers with code

Minimax Weight Learning for Absorbing MDPs

no code implementations9 Jan 2023 Fengyin Li, Yuqiang Li, Xianyi Wu

Reinforcement learning policy evaluation problems are often modeled as finite or discounted/averaged infinite-horizon MDPs.

reinforcement-learning Reinforcement Learning (RL)

A General Framework of Multi-Armed Bandit Processes by Arm Switch Restrictions

no code implementations20 Aug 2018 Wenqing Bao, Xiaoqiang Cai, Xianyi Wu

This paper proposes a general framework of multi-armed bandit (MAB) processes by introducing a type of restrictions on the switches among arms evolving in continuous time.

Cannot find the paper you are looking for? You can Submit a new open access paper.