no code implementations • 2 Jan 2022 • Arun Raman, Keerthan Shagrithaya, Shalabh Bhatnagar
We assume that the set of action sequences that are deemed unsafe and/or safe are given in terms of a finite-state automaton; and propose a supervisor that disables a subset of actions at every state of the MDP so that the constraints on action sequence are satisfied.