Doubly robust off-policy value evaluation for reinforcement learning

Nan Jiang, Lihong Li

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fingerprint

Dive into the research topics of 'Doubly robust off-policy value evaluation for reinforcement learning'. Together they form a unique fingerprint.

Engineering & Materials Science