no code implementations • 5 Feb 2021 • Zhengshang Liu, Yue Yang, Tim Miller, Peta Masters
However, in some situations, we may want to keep a reward function private; that is, to make it difficult for an observer to determine the reward function used.