Learning in POMDPs is Sample-Efficient with Hindsight Observability

Jonathan N. Lee, Alekh Agarwal, Christoph Dann, Tong Zhang

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'Learning in POMDPs is Sample-Efficient with Hindsight Observability'. Together they form a unique fingerprint.

Computer Science

Keyphrases

Engineering

Mathematics