One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

Pedro Cisneros-Velarde, Boxiang Lyu, Sanmi Koyejo, Mladen Kolar

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning'. Together they form a unique fingerprint.

Keyphrases

Computer Science