Search Results for author: Matt Peng

Found 2 papers, 0 papers with code

An Adaptive State Aggregation Algorithm for Markov Decision Processes

no code implementations • 23 Jul 2021 • Guanting Chen, Johann Demetrio Gaebler, Matt Peng, Chunlin Sun, Yinyu Ye

Value iteration is a well-known method of solving Markov Decision Processes (MDPs) that is simple to implement and boasts strong theoretical convergence guarantees.

Paper
Add Code

Linear Representation Meta-Reinforcement Learning for Instant Adaptation

no code implementations • 12 Jan 2021 • Matt Peng, Banghua Zhu, Jiantao Jiao

This paper introduces Fast Linearized Adaptive Policy (FLAP), a new meta-reinforcement learning (meta-RL) method that is able to extrapolate well to out-of-distribution tasks without the need to reuse data from training, and adapt almost instantaneously with the need of only a few samples during testing.

Continuous Control Meta Reinforcement Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.