Action-dependent control variates for policy optimization via Stein’s identity

Hao Liu, Yihao Feng, Yi Mao, Dengyong Zhou, Jian Peng, Qiang Liu

Research output: Contribution to conferencePaperpeer-review

Fingerprint

Dive into the research topics of 'Action-dependent control variates for policy optimization via Stein’s identity'. Together they form a unique fingerprint.

Engineering & Materials Science

Social Sciences

Arts & Humanities