Stochastic primal-dual Q-learning algorithm for discounted mdps

Donghwan Lee, Niao He

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fingerprint

Dive into the research topics of 'Stochastic primal-dual Q-learning algorithm for discounted mdps'. Together they form a unique fingerprint.

Engineering & Materials Science