no code implementations • 15 Feb 2024 • Yinglun Xu, Rohan Gumaste, Gagandeep Singh
To the best of our knowledge, we propose the first black-box reward poisoning attack in the general offline RL setting.
Offline RL reinforcement-learning