Search Results for author: Roderick Bloem

Found 11 papers, 3 papers with code

Safety Shielding under Delayed Observation

1 code implementation5 Jul 2023 Filip Cano Córdoba, Alexander Palmisano, Martin Fränzle, Roderick Bloem, Bettina Könighofer

We propose synthesis algorithms to compute \emph{delay-resilient shields} that guarantee safety under worst-case assumptions on the delays of the input signals.

Autonomous Driving

Automata Learning meets Shielding

1 code implementation4 Dec 2022 Martin Tappler, Stefan Pranger, Bettina Könighofer, Edi Muškardin, Roderick Bloem, Kim Larsen

Iteratively, we use the collected data to learn new MDPs with higher accuracy, resulting in turn in shields able to prevent more safety violations.

Q-Learning Reinforcement Learning (RL)

Online Shielding for Reinforcement Learning

no code implementations4 Dec 2022 Bettina Könighofer, Julian Rudolf, Alexander Palmisano, Martin Tappler, Roderick Bloem

The intuition behind online shielding is to compute at runtime the set of all states that could be reached in the near future.

reinforcement-learning Reinforcement Learning (RL)

Correct-by-Construction Runtime Enforcement in AI -- A Survey

no code implementations30 Aug 2022 Bettina Könighofer, Roderick Bloem, Rüdiger Ehlers, Christian Pek

In this paper, we are interested in techniques for constructing runtime enforcers for the concrete application domain of enforcing safety in AI.

Self-Learning

Online Shielding for Stochastic Systems

no code implementations17 Dec 2020 Bettina Könighofer, Julian Rudolf, Alexander Palmisano, Martin Tappler, Roderick Bloem

The intuition behind online shielding is to compute during run-time the set of all states that could be reached in the near future.

Logic in Computer Science

Safety Synthesis Sans Specification

no code implementations15 Nov 2020 Roderick Bloem, Hana Chockler, Masoud Ebrahimi, Dana Fisman, Heinz Riener

We define the problem of learning a transducer ${S}$ from a target language $U$ containing possibly conflicting transducers, using membership queries and conjecture queries.

Learning a Behavior Model of Hybrid Systems Through Combining Model-Based Testing and Machine Learning (Full Version)

no code implementations10 Jul 2019 Bernhard K. Aichernig, Roderick Bloem, Masoud Ebrahimi, Martin Horn, Franz Pernkopf, Wolfgang Roth, Astrid Rupp, Martin Tappler, Markus Tranninger

Therefore, there is considerable interest in learning such hybrid behavior by means of machine learning which requires sufficient and representative training data covering the behavior of the physical system adequately.

BIG-bench Machine Learning

Safe Reinforcement Learning via Probabilistic Shields

no code implementations16 Jul 2018 Nils Jansen, Bettina Könighofer, Sebastian Junges, Alexandru C. Serban, Roderick Bloem

This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty.

Decision Making reinforcement-learning +3

Safe Reinforcement Learning via Shielding

1 code implementation29 Aug 2017 Mohammed Alshiekh, Roderick Bloem, Ruediger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu

In the first one, the shield acts each time the learning agent is about to make a decision and provides a list of safe actions.

reinforcement-learning Reinforcement Learning (RL) +1

QBF Solving by Counterexample-guided Expansion

no code implementations4 Nov 2016 Roderick Bloem, Nicolas Braud-Santoni, Vedad Hadzic

We introduce a novel generalization of Counterexample-Guided Inductive Synthesis (CEGIS) and instantiate it to yield a novel, competitive algorithm for solving Quantified Boolean Formulas (QBF).

Cannot find the paper you are looking for? You can Submit a new open access paper.