no code implementations • 15 Nov 2020 • David Radu, Mathias Berger, Antoine Dubois, Raphael Fonteneau, Hrvoje Pandzic, Yury Dvorkin, Quentin Louveaux, Damien Ernst
In addition, two variants of these siting schemes are provided, wherein the number of sites to be selected is specified on a country-by-country basis rather than Europe-wide.
no code implementations • 22 Sep 2017 • Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau
This paper provides an analysis of the tradeoff between asymptotic bias (suboptimality with unlimited data) and overfitting (additional suboptimality due to limited data) in the context of reinforcement learning with partial observability.
no code implementations • 7 Dec 2015 • Vincent François-Lavet, Raphael Fonteneau, Damien Ernst
When the discount factor progressively increases up to its final value, we empirically show that it is possible to significantly reduce the number of learning steps.
no code implementations • 14 Sep 2015 • Michael Castronovo, Damien Ernst, Adrien Couetoux, Raphael Fonteneau
In order to enable the comparison of non-anytime algorithms, our methodology also includes a detailed analysis of the computation time requirement of each algorithm.
no code implementations • 18 Mar 2014 • Raphael Fonteneau, L. A. Prashanth
We propose novel policy search algorithms in the context of off-policy, batch mode reinforcement learning (RL) with continuous state and action spaces.
no code implementations • NeurIPS 2013 • Gunnar Kedenburg, Raphael Fonteneau, Remi Munos
This paper addresses the problem of online planning in Markov Decision Processes using only a generative model.