no code implementations • 8 Jan 2024 • Riccardo Poiani, Gabriele Curti, Alberto Maria Metelli, Marcello Restelli
For this reason, in this work, we extend the IRL formulation to problems where, in addition to demonstrations from the optimal agent, we can observe the behavior of multiple sub-optimal experts.