Seminar: Path Integral Reinforcement Learning
| Speaker | Bert Kappen |
|---|---|
| Affiliation | Radboud University Nijmegen |
| Date | Wednesday, 11 Apr 2012 |
| Time | 16:00 - 17:00 |
| Location | Basement Seminar Room B10, Alexandra House, 17 Queen Square, WC1N 3AR |
| Event series | Gatsby Unit Seminars |
| Description |
Stochastic optimal control theory provides a principled answer to the problem of computing an optimal sequence of actions to reach a goal in the presence of uncertainty. The solution is based on dynamic programming and known as the Bellman equation. However, the actual computation is typically very costly and scales exponentially in the dimension of the problem. Recently, it was shown that a quite large class of non-linear control problems could be solved using an alternative approach using a diffusion process. The optimal control can be represented as a path integral, an expectation over future trajectories. This solution can be computed much more efficiently, using MCMC. In this talk, I will introduce the theory and give some examples. In addition, I show how the path integral control formalism can be used for learning the optimal control of a deterministic plant. I will illustrate the approach with some examples. |
| iCalendar | csml_id_22.ics |
