On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction

Jiawei Huang, Nan Jiang

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction'. Together they form a unique fingerprint.

Mathematics