Search Results for author: Ajin Joseph

Found 2 papers, 1 papers with code

Two-Timescale Networks for Nonlinear Value Function Approximation

no code implementations • ICLR 2019 • Wesley Chung, Somjit Nath, Ajin Joseph, Martha White

A key component for many reinforcement learning agents is to learn a value function, either for policy evaluation or control.

Q-Learning Vocal Bursts Valence Prediction

Paper
Add Code

Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement

1 code implementation • 22 Oct 2018 • Samuel Neumann, Sungsu Lim, Ajin Joseph, Yangchen Pan, Adam White, Martha White

We first provide a policy improvement result in an idealized setting, and then prove that our conditional CEM (CCEM) strategy tracks a CEM update per state, even with changing action-values.

Policy Gradient Methods Q-Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.