Orateur
Description
The characterization of optimal feedback for nonlinear dynamical systems involves solving a Hamilton-Jacobi-Bellman equation. This is a nonlinear first order hyperbolic equation in a space whose dimension is that of the state-space of the underlying system. Thus solving this system in practice one is confronted with a curse of dimensionality.
In this talk we present techniques which either, in part, circumvent the necessity of solving the HJB equations directly, or use system reduction techniques to alleviate the difficulty associated with high dimensions. Specifically, we briefly describe experience with a succinct use of policy iteration, we introduce a data-driven technique which exploits higher order information, and we explain the 'Averaged Feedback Learning Scheme'. - Frequently the solution to the HJB equation is semi-concave. This motivates the last point in our talk: an approximation scheme which is structure preserving.
This is joint work with
Behzad Azmi, Uni-Konstanz; S. Dolgov, Uni-Bath; Donato Vaquez-Varas, Radon Institute, Austrian Academy of Sciences; and Daniel Walter, Humboldt University Berlin.