no code implementations • 15 Aug 2013 • Yury Sokolov, Robert Kozma, Ludmilla D. Werbos, Paul J. Werbos
This paper provides new stability results for Action-Dependent Heuristic Dynamic Programming (ADHDP), using a control algorithm that iteratively improves an internal model of the external world in the autonomous system based on its continuous interaction with the environment.