no code implementations • 24 May 2024 • Martin Haugh, Raghav Singal
To address this, we introduce a structural causal model (SCM) consistent with the HMM and show that the expected winnings attributable to cheating (EWAC) can be bounded using linear programs (LPs).
no code implementations • 12 Jan 2024 • Garud Iyengar, Raghav Singal
We model consumer behavior by a conversion funnel that captures the state of each consumer (e. g., interaction history with the firm) and allows the consumer behavior to vary as a function of both her state and firm's sequential interventions.
no code implementations • 27 May 2022 • Martin Haugh, Raghav Singal
We provide an optimization-based framework to perform counterfactual analysis in a dynamic model with hidden states.
no code implementations • 6 Jun 2018 • Jalaj Bhandari, Daniel Russo, Raghav Singal
Temporal difference learning (TD) is a simple iterative algorithm used to estimate the value function corresponding to a given policy in a Markov decision process.