1 code implementation • 4 Jun 2020 • Josh Roy, George Konidaris
We introduce Wasserstein Adversarial Proximal Policy Optimization (WAPPO), a novel algorithm for visual transfer in Reinforcement Learning that explicitly learns to align the distributions of extracted features between a source and target task.
no code implementations • 25 Sep 2019 • Josh Roy, George Konidaris
In such settings, agents are trained in similar environments, such as simulators, and are then transferred to the original environment.