Search Results for author: Agustin Castellano

Found 5 papers, 0 papers with code

Learning safety critics via a non-contractive binary bellman operator

no code implementations23 Jan 2024 Agustin Castellano, Hancheng Min, Juan Andrés Bazerque, Enrique Mallada

To that end, we study the properties of the binary safety critic associated with a deterministic dynamical system that seeks to avoid reaching an unsafe region.

Reinforcement Learning (RL)

Reinforcement Learning with Almost Sure Constraints

no code implementations9 Dec 2021 Agustin Castellano, Hancheng Min, Juan Bazerque, Enrique Mallada

We argue that stationary policies are not sufficient for solving this problem, and that a rich class of policies can be found by endowing the controller with a scalar quantity, so called budget, that tracks how close the agent is to violating the constraint.

Navigate reinforcement-learning +1

Learning to Act Safely with Limited Exposure and Almost Sure Certainty

no code implementations18 May 2021 Agustin Castellano, Hancheng Min, Juan Bazerque, Enrique Mallada

Our analysis further highlights a trade-off between the time lag for the underlying MDP necessary to detect unsafe actions, and the level of exposure to unsafe events.

Navigate

Assured RL: Reinforcement Learning with Almost Sure Constraints

no code implementations24 Dec 2020 Agustin Castellano, Juan Bazerque, Enrique Mallada

We consider the problem of finding optimal policies for a Markov Decision Process with almost sure constraints on state transitions and action triplets.

Q-Learning reinforcement-learning +1

Learning to be safe, in finite time

no code implementations1 Oct 2020 Agustin Castellano, Juan Bazerque, Enrique Mallada

More precisely, by defining a handicap metric that counts the number of unsafe actions, we provide an algorithm for discarding unsafe machines (or actions), with probability one, that achieves constant handicap.

Cannot find the paper you are looking for? You can Submit a new open access paper.