Search Results for author: Roland Hafner

Found 21 papers, 1 papers with code

Offline Actor-Critic Reinforcement Learning Scales to Large Models

no code implementations • 8 Feb 2024 • Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin Riedmiller

We show that offline actor-critic reinforcement learning can scale to large models - such as transformers - and follows similar scaling laws as supervised learning.

Continuous Control Offline RL +1

Paper
Add Code

Less is more -- the Dispatcher/ Executor principle for multi-task Reinforcement Learning

no code implementations • 14 Dec 2023 • Martin Riedmiller, Tim Hertweck, Roland Hafner

While we agree on the power of scaling - in the sense of Sutton's 'bitter lesson' - we will give some evidence, that considering structure and adding design principles can be a valuable and critical component in particular when data is not abundant and infinite, but is a precious resource.

Decision Making

Paper
Add Code

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

no code implementations • 24 May 2023 • Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee, Linda Luu, Ofir Nachum, Ken Oslund, Jason Powell, Diego Reyes, Francesco Romano, Feresteh Sadeghi, Ron Sloat, Baruch Tabanpour, Daniel Zheng, Michael Neunert, Raia Hadsell, Nicolas Heess, Francesco Nori, Jeff Seto, Carolina Parada, Vikas Sindhwani, Vincent Vanhoucke, Jie Tan

In the second approach, we distill the specialist skills into a Transformer-based generalist locomotion policy, named Locomotion-Transformer, that can handle various terrains and adjust the robot's gait based on the perceived environment and robot states.

Benchmarking Navigate

Paper
Add Code

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

no code implementations • 26 Apr 2023 • Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Jan Humplik, Markus Wulfmeier, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley, Francesco Nori, Raia Hadsell, Nicolas Heess

We investigate whether Deep Reinforcement Learning (Deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies in dynamic environments.

reinforcement-learning

Paper
Add Code

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

no code implementations • 24 Nov 2022 • Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin Riedmiller

The ability to effectively reuse prior knowledge is a key requirement when building general and flexible Reinforcement Learning (RL) agents.

Reinforcement Learning (RL)

Paper
Add Code

Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors

no code implementations • 31 Mar 2022 • Steven Bohez, Saran Tunyasuvunakool, Philemon Brakel, Fereshteh Sadeghi, Leonard Hasenclever, Yuval Tassa, Emilio Parisotto, Jan Humplik, Tuomas Haarnoja, Roland Hafner, Markus Wulfmeier, Michael Neunert, Ben Moran, Noah Siegel, Andrea Huber, Francesco Romano, Nathan Batchelor, Federico Casarini, Josh Merel, Raia Hadsell, Nicolas Heess

We investigate the use of prior knowledge of human and animal movement to learn reusable locomotion skills for real legged robots.

Paper
Add Code

Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration

no code implementations • 17 Sep 2021 • Oliver Groth, Markus Wulfmeier, Giulia Vezzani, Vibhavari Dasagi, Tim Hertweck, Roland Hafner, Nicolas Heess, Martin Riedmiller

Curiosity-based reward schemes can present powerful exploration mechanisms which facilitate the discovery of solutions for complex, sparse or long-horizon tasks.

Paper
Add Code

Collect & Infer -- a fresh look at data-efficient Reinforcement Learning

no code implementations • 23 Aug 2021 • Martin Riedmiller, Jost Tobias Springenberg, Roland Hafner, Nicolas Heess

This position paper proposes a fresh look at Reinforcement Learning (RL) from the perspective of data-efficiency.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Representation Matters: Improving Perception and Exploration for Robotics

no code implementations • 3 Nov 2020 • Markus Wulfmeier, Arunkumar Byravan, Tim Hertweck, Irina Higgins, Ankush Gupta, tejas kulkarni, Malcolm Reynolds, Denis Teplyashin, Roland Hafner, Thomas Lampe, Martin Riedmiller

Furthermore, the value of each representation is evaluated in terms of three properties: dimensionality, observability and disentanglement.

Disentanglement

Paper
Add Code

Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion

no code implementations • 6 Aug 2020 • Roland Hafner, Tim Hertweck, Philipp Klöppner, Michael Bloesch, Michael Neunert, Markus Wulfmeier, Saran Tunyasuvunakool, Nicolas Heess, Martin Riedmiller

Modern Reinforcement Learning (RL) algorithms promise to solve difficult motor control problems directly from raw sensory inputs.

Reinforcement Learning (RL)

Paper
Add Code

Data-efficient Hindsight Off-policy Option Learning

no code implementations • 30 Jul 2020 • Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala, Noah Siegel, Nicolas Heess, Martin Riedmiller

We introduce Hindsight Off-policy Options (HO2), a data-efficient option learning algorithm.

Robot Manipulation

Paper
Add Code

Simple Sensor Intentions for Exploration

no code implementations • 15 May 2020 • Tim Hertweck, Martin Riedmiller, Michael Bloesch, Jost Tobias Springenberg, Noah Siegel, Markus Wulfmeier, Roland Hafner, Nicolas Heess

In particular, we show that a real robotic arm can learn to grasp and lift and solve a Ball-in-a-Cup task from scratch, when only raw sensor streams are used for both controller input and in the auxiliary reward definition.

Paper
Add Code

Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning

no code implementations • ICLR 2020 • Noah Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin Riedmiller

In practice, however, standard off-policy algorithms fail in the batch setting for continuous control.

Continuous Control Multi-Task Learning +2

Paper
Add Code

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning

no code implementations • 19 Feb 2020 • Noah Y. Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin Riedmiller

In practice, however, standard off-policy algorithms fail in the batch setting for continuous control.

Continuous Control Multi-Task Learning +2

Paper
Add Code

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics

no code implementations • 2 Jan 2020 • Michael Neunert, Abbas Abdolmaleki, Markus Wulfmeier, Thomas Lampe, Jost Tobias Springenberg, Roland Hafner, Francesco Romano, Jonas Buchli, Nicolas Heess, Martin Riedmiller

In contrast, we propose to treat hybrid problems in their 'native' form by solving them with hybrid reinforcement learning, which optimizes for discrete and continuous actions simultaneously.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

no code implementations • 9 Oct 2019 • Arunkumar Byravan, Jost Tobias Springenberg, Abbas Abdolmaleki, Roland Hafner, Michael Neunert, Thomas Lampe, Noah Siegel, Nicolas Heess, Martin Riedmiller

Humans are masters at quickly learning many complex tasks, relying on an approximate understanding of the dynamics of their environments.

Model-based Reinforcement Learning Reinforcement Learning (RL) +2

Paper
Add Code

Compositional Transfer in Hierarchical Reinforcement Learning

no code implementations • 26 Jun 2019 • Markus Wulfmeier, Abbas Abdolmaleki, Roland Hafner, Jost Tobias Springenberg, Michael Neunert, Tim Hertweck, Thomas Lampe, Noah Siegel, Nicolas Heess, Martin Riedmiller

The successful application of general reinforcement learning algorithms to real-world robotics applications is often limited by their high data requirements.

General Reinforcement Learning Hierarchical Reinforcement Learning +4

Paper
Add Code

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

no code implementations • 13 Feb 2019 • Devin Schwab, Tobias Springenberg, Murilo F. Martins, Thomas Lampe, Michael Neunert, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin Riedmiller

We present a method for fast training of vision based control policies on real robots.

Imitation Learning

Paper
Add Code

Learning by Playing - Solving Sparse Reward Tasks from Scratch

1 code implementation • ICML 2018 • Martin Riedmiller, Roland Hafner, Thomas Lampe, Michael Neunert, Jonas Degrave, Tom Van de Wiele, Volodymyr Mnih, Nicolas Heess, Jost Tobias Springenberg

We propose Scheduled Auxiliary Control (SAC-X), a new learning paradigm in the context of Reinforcement Learning (RL).

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations

no code implementations • 27 May 2017 • Rico Jonschkowski, Roland Hafner, Jonathan Scholz, Martin Riedmiller

We propose position-velocity encoders (PVEs) which learn---without supervision---to encode images to positions and velocities of task-relevant objects.

Image Reconstruction Position

Paper
Add Code

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

no code implementations • ICLR 2018 • Ivaylo Popov, Nicolas Heess, Timothy Lillicrap, Roland Hafner, Gabriel Barth-Maron, Matej Vecerik, Thomas Lampe, Yuval Tassa, Tom Erez, Martin Riedmiller

Solving this difficult and practically relevant problem in the real world is an important long-term goal for the field of robotics.

Continuous Control Q-Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.