Dream to Control: Learning Behaviors by Latent Imagination
arXiv (Cornell University)2019
Citations Over Time
Abstract
Learned world models summarize an agent's experience to facilitate learning complex behaviors. While learning world models from high-dimensional sensory inputs is becoming feasible through deep learning, there are many potential ways for deriving behaviors from them. We present Dreamer, a reinforcement learning agent that solves long-horizon tasks from images purely by latent imagination. We efficiently learn behaviors by propagating analytic gradients of learned state values back through trajectories imagined in the compact state space of a learned world model. On 20 challenging visual control tasks, Dreamer exceeds existing approaches in data-efficiency, computation time, and final performance.
Related Papers
- → Human-level control through deep reinforcement learning(2015)29,160 cited
- → Reinforcement Learning: An Introduction(2005)25,702 cited
- → Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)(2017)11,253 cited
- → CURL: Contrastive Unsupervised Representations for Reinforcement\n Learning(2020)394 cited
- → Learning Latent Dynamics for Planning from Pixels(2018)367 cited