Global convergence of policy gradient methods to (almost) locally optimal policies

Kaiqing Zhang, Alec Koppel, Hao Zhu, Tamer Başar

Research output: Contribution to journalArticlepeer-review

Fingerprint

Dive into the research topics of 'Global convergence of policy gradient methods to (almost) locally optimal policies'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science