David Ha 2/15/2019

Learning Latent Dynamics for Planning from Pixels

Read Original

This research paper presents the Deep Planning Network (PlaNet), a model-based agent that learns a latent dynamics model directly from pixel observations. It uses a combination of deterministic and stochastic transitions with a novel multi-step variational inference objective called latent overshooting. PlaNet solves complex continuous control tasks with sparse rewards and partial observability, achieving data efficiency far superior to model-free methods while matching or exceeding their final performance.

Learning Latent Dynamics for Planning from Pixels

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser

Top of the Week

1
The Beautiful Web
Jens Oliver Meiert 2 votes
3
LLM Use in the Python Source Code
Miguel Grinberg 1 votes
4
Wagon’s algorithm in Python
John D. Cook 1 votes