LEARNING LONG-TERM REWARD REDISTRIBUTION VIA RANDOMIZED RETURN DECOMPOSITION

Zhizhou Ren, Ruihan Guo, Yuan Zhou, Jian Peng

Research output: Contribution to conferencePaperpeer-review

Fingerprint

Dive into the research topics of 'LEARNING LONG-TERM REWARD REDISTRIBUTION VIA RANDOMIZED RETURN DECOMPOSITION'. Together they form a unique fingerprint.

Keyphrases

Mathematics

Engineering

Computer Science

Chemical Engineering