Search Results for author: Xianyi Wu

Found 2 papers, 0 papers with code

Minimax Weight Learning for Absorbing MDPs

no code implementations • 9 Jan 2023 • Fengyin Li, Yuqiang Li, Xianyi Wu

Reinforcement learning policy evaluation problems are often modeled as finite or discounted/averaged infinite-horizon MDPs.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A General Framework of Multi-Armed Bandit Processes by Arm Switch Restrictions

no code implementations • 20 Aug 2018 • Wenqing Bao, Xiaoqiang Cai, Xianyi Wu

This paper proposes a general framework of multi-armed bandit (MAB) processes by introducing a type of restrictions on the switches among arms evolving in continuous time.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.