Search Results for author: Francesco Vidaich

Found 1 papers, 0 papers with code

Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization

no code implementations • 13 Dec 2021 • Pierre Liotet, Francesco Vidaich, Alberto Maria Metelli, Marcello Restelli

This hyper-policy is trained to maximize the estimated future performance, efficiently reusing past data by means of importance sampling, at the cost of introducing a controlled bias.

Management

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.