Search Results for author: Ashutosh Trivedi

Found 30 papers, 2 papers with code

Integrating Explanations in Learning LTL Specifications from Demonstrations

no code implementations3 Apr 2024 Ashutosh Gupta, John Komp, Abhay Singh Rajput, Krishna Shankaranarayanan, Ashutosh Trivedi, Namrita Varshney

This paper investigates whether recent advances in Large Language Models (LLMs) can assist in translating human explanations into a format that can robustly support learning Linear Temporal Logic (LTL) from demonstrations.

Analyzing the Effectiveness of Large Language Models on Text-to-SQL Synthesis

1 code implementation22 Jan 2024 Richard Roberson, Gowtham Kaki, Ashutosh Trivedi

This study investigates various approaches to using Large Language Models (LLMs) for Text-to-SQL program synthesis, focusing on the outcomes and insights derived.

16k SQL Synthesis +1

Assume-Guarantee Reinforcement Learning

no code implementations15 Dec 2023 Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Alvaro Velasquez

We present a modular approach to \emph{reinforcement learning} (RL) in environments consisting of simpler components evolving in parallel.

reinforcement-learning Reinforcement Learning (RL)

Omega-Regular Decision Processes

no code implementations14 Dec 2023 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback).

On the Potential and Limitations of Few-Shot In-Context Learning to Generate Metamorphic Specifications for Tax Preparation Software

no code implementations20 Nov 2023 Dananjay Srinivas, Rohan Das, Saeid Tizpaz-Niari, Ashutosh Trivedi, Maria Leonor Pacheco

Due to the ever-increasing complexity of income tax laws in the United States, the number of US taxpayers filing their taxes using tax preparation software (henceforth, tax software) continues to increase.

In-Context Learning

A PAC Learning Algorithm for LTL and Omega-regular Objectives in MDPs

no code implementations18 Oct 2023 Mateo Perez, Fabio Somenzi, Ashutosh Trivedi

Linear temporal logic (LTL) and omega-regular objectives -- a superset of LTL -- have seen recent use as a way to express non-Markovian objectives in reinforcement learning.

PAC learning reinforcement-learning

Omega-Regular Reward Machines

no code implementations14 Aug 2023 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success.

Reinforcement Learning (RL)

Closure Certificates

no code implementations27 May 2023 Vishnu Murali, Ashutosh Trivedi, Majid Zamani

A barrier certificate, defined over the states of a dynamical system, is a real-valued function whose zero level set characterizes an inductively verifiable state invariant separating reachable states from unsafe ones.

Logic in Computer Science Systems and Control Systems and Control

Policy Synthesis and Reinforcement Learning for Discounted LTL

no code implementations26 May 2023 Rajeev Alur, Osbert Bastani, Kishor Jothimurugan, Mateo Perez, Fabio Somenzi, Ashutosh Trivedi

The difficulty of manually specifying reward functions has led to an interest in using linear temporal logic (LTL) to express objectives for reinforcement learning (RL).

PAC learning reinforcement-learning +1

Information-Theoretic Testing and Debugging of Fairness Defects in Deep Neural Networks

no code implementations9 Apr 2023 Verya Monjezi, Ashutosh Trivedi, Gang Tan, Saeid Tizpaz-Niari

Guided by the quantitative fairness, we present a causal debugging framework to localize inadequately trained layers and neurons responsible for fairness defects.

Decision Making Fairness

Reinforcement Learning with Depreciating Assets

no code implementations27 Feb 2023 Taylor Dohmen, Ashutosh Trivedi

A basic assumption of traditional reinforcement learning is that the value of a reward does not change once it is received by an agent.

reinforcement-learning Reinforcement Learning (RL)

Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems

no code implementations6 Aug 2022 Abolfazl Lavaei, Mateo Perez, Milad Kazemi, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Majid Zamani

A key contribution is to leverage the convergence results for adversarial RL (minimax Q-learning) on finite stochastic arenas to provide control strategies maximizing the probability of satisfaction over the network of continuous-space systems.

Q-Learning reinforcement-learning +1

Alternating Good-for-MDP Automata

no code implementations6 May 2022 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

The surprising answer is that we have to pay significantly less when we instead expand the good-for-MDP property to alternating automata: like the nondeterministic GFM automata obtained from deterministic Rabin automata, the alternating good-for-MDP automata we produce from deterministic Streett automata are bi-linear in the the size of the deterministic automaton and its index, and can therefore be exponentially more succinct than minimal nondeterministic B\"uchi automata.

Reinforcement Learning (RL) Translation

Fairness-aware Configuration of Machine Learning Libraries

2 code implementations13 Feb 2022 Saeid Tizpaz-Niari, Ashish Kumar, Gang Tan, Ashutosh Trivedi

This paper investigates the parameter space of machine learning (ML) algorithms in aggravating or mitigating fairness bugs.

BIG-bench Machine Learning Fairness

Inferring Probabilistic Reward Machines from Non-Markovian Reward Processes for Reinforcement Learning

no code implementations9 Jul 2021 Taylor Dohmen, Noah Topper, George Atia, Andre Beckus, Ashutosh Trivedi, Alvaro Velasquez

The success of reinforcement learning in typical settings is predicated on Markovian assumptions on the reward signal by which an agent learns optimal policies.

Decision Making reinforcement-learning +1

Model-free Reinforcement Learning for Branching Markov Decision Processes

no code implementations12 Jun 2021 Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs).

reinforcement-learning Reinforcement Learning (RL)

Controller Synthesis for Omega-Regular and Steady-State Specifications

no code implementations5 Jun 2021 Alvaro Velasquez, Ismail Alkhouri, Andre Beckus, Ashutosh Trivedi, George Atia

Given a Markov decision process (MDP) and a linear-time ($\omega$-regular or LTL) specification, the controller synthesis problem aims to compute the optimal policy that satisfies the specification.

Verification of Hyperproperties for Uncertain Dynamical Systems via Barrier Certificates

no code implementations12 May 2021 Mahathi Anand, Vishnu Murali, Ashutosh Trivedi, Majid Zamani

These verification conditions are then discharged by synthesizing so-called augmented barrier certificates, which provide certain safety guarantees for the underlying system.

Detecting and Understanding Real-World Differential Performance Bugs in Machine Learning Libraries

no code implementations3 Jun 2020 Saeid Tizpaz-Niari, Pavol Cerný, Ashutosh Trivedi

On a set of micro-benchmarks, we show that our approach outperforms state-of-the-art fuzzers in finding inputs to characterize the differential performance.

BIG-bench Machine Learning Clustering

Formal Controller Synthesis for Continuous-Space MDPs via Model-Free Reinforcement Learning

no code implementations2 Mar 2020 Abolfazl Lavaei, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Majid Zamani

A key contribution of the paper is to leverage the classical convergence results for reinforcement learning on finite MDPs and provide control strategies maximizing the probability of satisfaction over unknown, continuous-space MDPs while providing probabilistic closeness guarantees.

reinforcement-learning Reinforcement Learning (RL)

Efficient Detection and Quantification of Timing Leaks with Neural Networks

no code implementations23 Jul 2019 Saeid Tizpaz-Niari, Pavol Cerny, Sriram Sankaranarayanan, Ashutosh Trivedi

As demonstrated in our experiments, both of these tasks are feasible in practice --- making the approach a significant improvement over the state-of-the-art side channel detectors and quantifiers.

Quantitative Mitigation of Timing Side Channels

no code implementations21 Jun 2019 Saeid Tizpaz-Niari, Pavol Cerny, Ashutosh Trivedi

In contrast to the existing mitigation approaches, we show that in the functional-observation threat model, SCHMIT is scalable and able to maximize confidentiality under the performance overhead bound.

Data-Driven Debugging for Functional Side Channels

no code implementations30 Aug 2018 Saeid Tizpaz-Niari, Pavol Cerny, Ashutosh Trivedi

On the realistic programs, we show the scalability of FUCHSIA in analyzing functional side channels in Java programs with thousands of methods.

Clustering

Differential Performance Debugging with Discriminant Regression Trees

no code implementations11 Nov 2017 Saeid Tizpaz-Niari, Pavol Cerny, Bor-Yuh Evan Chang, Ashutosh Trivedi

We propose a data-driven technique based on discriminant regression tree (DRT) learning problem where the goal is to discriminate among different classes of inputs.

Clustering regression

Discriminating Traces with Time

no code implementations23 Feb 2017 Saeid Tizpaz-Niari, Pavol Cerny, Bor-Yuh Evan Chang, Sriram Sankaranarayanan, Ashutosh Trivedi

What properties about the internals of a program explain the possible differences in its overall running time for different inputs?

Proceedings of the The First Workshop on Verification and Validation of Cyber-Physical Systems

no code implementations13 Dec 2016 Mehdi Kargahi, Ashutosh Trivedi

The first International Workshop on Verification and Validation of Cyber-Physical Systems (V2CPS-16) was held in conjunction with the 12th International Conference on integration of Formal Methods (iFM 2016) in Reykjavik, Iceland.

Cannot find the paper you are looking for? You can Submit a new open access paper.