Search Results for author: Zhimin Hou

Found 3 papers, 1 papers with code

How does the structure embedded in learning policy affect learning quadruped locomotion?

no code implementations • 29 Aug 2020 • Kuangen Zhang, Jongwoo Lee, Zhimin Hou, Clarence W. de Silva, Chenglong Fu, Neville Hogan

This paper focuses on the latter because the structured policy is more intuitive and can inherit insights from previous model-based controllers.

Reinforcement Learning (RL)

Paper
Add Code

Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)

no code implementations • 7 Feb 2020 • Zhimin Hou, Kuangen Zhang, Yi Wan, Dongyu Li, Chenglong Fu, Haoyong Yu

A common way to solve this problem, known as Mixture-of-Experts, is to represent the policy as the weighted sum of multiple components, where different components perform well on different parts of the state space.

Continuous Control

Paper
Add Code

Teach Biped Robots to Walk via Gait Principles and Reinforcement Learning with Adversarial Critics

1 code implementation • 22 Oct 2019 • Kuangen Zhang, Zhimin Hou, Clarence W. de Silva, Haoyong Yu, Chenglong Fu

However, the local minima caused by unsuitable rewards and the overestimation of the cumulative reward impede the maximization of the cumulative reward.

Reinforcement Learning (RL)

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.