Almost optimal model-free reinforcement learning via reference-advantage decomposition

Zihan Zhang, Yuan Zhou, Xiangyang Ji

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'Almost optimal model-free reinforcement learning via reference-advantage decomposition'. Together they form a unique fingerprint.

Engineering & Materials Science