From importance sampling to doubly robust policy gradient

Jiawei Huang, Nan Jiang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fingerprint

Dive into the research topics of 'From importance sampling to doubly robust policy gradient'. Together they form a unique fingerprint.

Keyphrases

Mathematics