no code implementations • 17 Apr 2019 • Reazul H. Russel, Tianyi Gu, Marek Petrik
Optimism about the poorly understood states and actions is the main driving force of exploration for many provably-efficient reinforcement learning algorithms.