Search Results for author: Vida Fathi

Found 1 papers, 0 papers with code

Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods

no code implementations • 29 Nov 2020 • Vida Fathi, Jalal Arabneydi, Amir G. Aghdam

In such systems, agents are partitioned into a few sub-populations wherein the agents in each sub-population are coupled in the dynamics and cost function through a set of linear regressions of the states and actions of all agents.

Policy Gradient Methods

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.