Search Results for author: Longxiang He

Found 2 papers, 1 papers with code

AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization

no code implementations28 May 2024 Longxiang He, Li Shen, Junbo Tan, Xueqian Wang

IDQL reinterprets IQL as an actor-critic method and gets weights of implicit policy, however, this weight only holds for the optimal value function.

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

1 code implementation9 Oct 2023 Longxiang He, Li Shen, Linrui Zhang, Junbo Tan, Xueqian Wang

Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).

D4RL Offline RL +1

Cannot find the paper you are looking for? You can Submit a new open access paper.