no code implementations • 8 Oct 2015 • John-Alexander M. Assael, Niklas Wahlström, Thomas B. Schön, Marc Peter Deisenroth
We consider a particularly important instance of this challenge, the pixels-to-torques problem, where an RL agent learns a closed-loop control policy ("torques") from pixel information only.
Model-based Reinforcement Learning Model Predictive Control +2
no code implementations • 27 Oct 2014 • John-Alexander M. Assael, Ziyu Wang, Bobak Shahriari, Nando de Freitas
At the core of this approach is a Gaussian process prior that captures our belief about the distribution over functions.