Search Results for author: Wouter Jongeneel

Found 3 papers, 0 papers with code

A Large Deviations Perspective on Policy Gradient Algorithms

no code implementations13 Nov 2023 Wouter Jongeneel, Mengmeng Li, Daniel Kuhn

Motivated by policy gradient methods in the context of reinforcement learning, we derive the first large deviation rate function for the iterates generated by stochastic gradient descent for possibly non-convex objectives satisfying a Polyak-Lojasiewicz condition.

Policy Gradient Methods reinforcement-learning

Small errors in random zeroth-order optimization are imaginary

no code implementations9 Mar 2021 Wouter Jongeneel, Man-Chung Yue, Daniel Kuhn

Most zeroth-order optimization algorithms mimic a first-order algorithm but replace the gradient of the objective function with some gradient estimator that can be computed from a small number of function evaluations.

Optimization and Control 65D25, 65G50, 65K05, 65Y04, 65Y20, 90C56

Topological Linear System Identification via Moderate Deviations Theory

no code implementations5 Mar 2021 Wouter Jongeneel, Tobias Sutter, Daniel Kuhn

Two dynamical systems are topologically equivalent when their phase-portraits can be morphed into each other by a homeomorphic coordinate transformation on the state space.

Optimization and Control

Cannot find the paper you are looking for? You can Submit a new open access paper.