Fingerprint
Dive into the research topics of 'Convergence and Iteration Complexity of Policy Gradient Method for Infinite-horizon Reinforcement Learning'. Together they form a unique fingerprint.- Sort by
- Weight
- Alphabetically
Kaiqing Zhang, Alec Koppel, Hao Zhu, Tamer Basar
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution