Search Results for author: Roderick Bloem

Found 11 papers, 3 papers with code

Safety Shielding under Delayed Observation

1 code implementation • 5 Jul 2023 • Filip Cano Córdoba, Alexander Palmisano, Martin Fränzle, Roderick Bloem, Bettina Könighofer

We propose synthesis algorithms to compute \emph{delay-resilient shields} that guarantee safety under worst-case assumptions on the delays of the input signals.

Autonomous Driving

Paper
Code

Automata Learning meets Shielding

1 code implementation • 4 Dec 2022 • Martin Tappler, Stefan Pranger, Bettina Könighofer, Edi Muškardin, Roderick Bloem, Kim Larsen

Iteratively, we use the collected data to learn new MDPs with higher accuracy, resulting in turn in shields able to prevent more safety violations.

Q-Learning Reinforcement Learning (RL)

Paper
Code

Online Shielding for Reinforcement Learning

no code implementations • 4 Dec 2022 • Bettina Könighofer, Julian Rudolf, Alexander Palmisano, Martin Tappler, Roderick Bloem

The intuition behind online shielding is to compute at runtime the set of all states that could be reached in the near future.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Correct-by-Construction Runtime Enforcement in AI -- A Survey

no code implementations • 30 Aug 2022 • Bettina Könighofer, Roderick Bloem, Rüdiger Ehlers, Christian Pek

In this paper, we are interested in techniques for constructing runtime enforcers for the concrete application domain of enforcing safety in AI.

Self-Learning

Paper
Add Code

Online Shielding for Stochastic Systems

no code implementations • 17 Dec 2020 • Bettina Könighofer, Julian Rudolf, Alexander Palmisano, Martin Tappler, Roderick Bloem

The intuition behind online shielding is to compute during run-time the set of all states that could be reached in the near future.

Logic in Computer Science

Paper
Add Code

Safety Synthesis Sans Specification

no code implementations • 15 Nov 2020 • Roderick Bloem, Hana Chockler, Masoud Ebrahimi, Dana Fisman, Heinz Riener

We define the problem of learning a transducer ${S}$ from a target language $U$ containing possibly conflicting transducers, using membership queries and conjecture queries.

Paper
Add Code

It's Time to Play Safe: Shield Synthesis for Timed Systems

no code implementations • 30 Jun 2020 • Roderick Bloem, Peter Gjøl Jensen, Bettina Könighofer, Kim Guldstrand Larsen, Florian Lorber, Alexander Palmisano

A timed pre-shield is placed before the system and provides a set of safe outputs.

Paper
Add Code

Learning a Behavior Model of Hybrid Systems Through Combining Model-Based Testing and Machine Learning (Full Version)

no code implementations • 10 Jul 2019 • Bernhard K. Aichernig, Roderick Bloem, Masoud Ebrahimi, Martin Horn, Franz Pernkopf, Wolfgang Roth, Astrid Rupp, Martin Tappler, Markus Tranninger

Therefore, there is considerable interest in learning such hybrid behavior by means of machine learning which requires sufficient and representative training data covering the behavior of the physical system adequately.

BIG-bench Machine Learning

Paper
Add Code

Safe Reinforcement Learning via Probabilistic Shields

no code implementations • 16 Jul 2018 • Nils Jansen, Bettina Könighofer, Sebastian Junges, Alexandru C. Serban, Roderick Bloem

This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty.

Decision Making reinforcement-learning +3

Paper
Add Code

Safe Reinforcement Learning via Shielding

1 code implementation • 29 Aug 2017 • Mohammed Alshiekh, Roderick Bloem, Ruediger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu

In the first one, the shield acts each time the learning agent is about to make a decision and provides a list of safe actions.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

QBF Solving by Counterexample-guided Expansion

no code implementations • 4 Nov 2016 • Roderick Bloem, Nicolas Braud-Santoni, Vedad Hadzic

We introduce a novel generalization of Counterexample-Guided Inductive Synthesis (CEGIS) and instantiate it to yield a novel, competitive algorithm for solving Quantified Boolean Formulas (QBF).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.