no code implementations • 3 May 2024 • Sang Bin Moon, Abolfazl Hashemi
The Adversarial Markov Decision Process (AMDP) is a learning framework that deals with unknown and varying tasks in decision-making applications like robotics and recommendation systems.