no code implementations • 9 Mar 2024 • Evan Ellis, Gaurav R. Ghosal, Stuart J. Russell, Anca Dragan, Erdem Biyik
Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task.
no code implementations • 23 Aug 2022 • Gaurav R. Ghosal, Matthew Zurek, Daniel S. Brown, Anca D. Dragan
In this work, we advocate that grounding the rationality coefficient in real data for each feedback type, rather than assuming a default value, has a significant positive effect on reward learning.
no code implementations • 17 Jun 2021 • Gaurav R. Ghosal, Reza Abbasi-Asl
We validate our framework on a simulated dataset with embedded patterns, as well as a real human activity recognition problem.