no code implementations • 9 May 2024 • Matteo Papini, Giorgio Manganini, Alberto Maria Metelli, Marcello Restelli
We provide an iterative algorithm that alternates between the cross-entropy estimation of the minimum-variance behavioral policy and the actual policy optimization, leveraging on defensive IS.
no code implementations • 9 Jul 2020 • Francesco Grassi, Giorgio Manganini, Michele Garraffa, Laura Mainini
Traditional methods for black box optimization require a considerable number of evaluations which can be time consuming, unpractical, and often unfeasible for many engineering applications that rely on accurate representations and expensive models to evaluate.