Summer school EUR MINT 2024 "Random Matrices and Free Probability"

Name: Summer school EUR MINT 2024 "Random Matrices and Free Probability"
Start: 2024-06-10T09:00:00+02:00
End: 2024-06-18T18:15:00+02:00
Location: Institut de Mathématiques

10–18 juin 2024

Institut de Mathématiques

Fuseau horaire Europe/Paris

Contact

Random matrices and dynamics of optimization in very high dimensions 1/3

17 juin 2024, 11:00

1h 30m

Amphithéâtre Schwartz (Institut de Mathématiques)

Amphithéâtre Schwartz

Institut de Mathématiques

Université Toulouse 3 Paul Sabatier 118 Route de Narbonne Institut de Mathématiques- Bâtiment 1R3 Toulouse

Gérard Ben Arous

Machine learning and Data science algorithms include the need for efficient optimization of topologically complex random functions in very high dimensions. Surprisingly, simple algorithms like Stochastic Gradient Descent (with small batches) are used very effectively.
I will concentrate on trying to understand why these simple tools can still work in these complex and very over-parametrized regimes.

I will first introduce the whole framework for non-experts, from the structure of the typical tasks to the natural structures of simple neural nets used in standard contexts. l will then cover briefly the classical and usual context of SGD in finite dimensions.
I will then survey recent work with Reza Gheissari (Northwestern), Aukosh Jagannath (Waterloo) giving a general view for the existence of projected “effective dynamics" for "summary statistics” in much smaller dimensions, which still rule the performance of very high dimensional systems, as well . These effective dynamics define a dynamical system in finite dimensions which may be quite complex, and rules the performance of the learning algorithm.
The next step will be to understand how the system finds these low dimensional “summary statistics”.
RMT enters the game for this next step (which is done in the next works with the same authors and with Jiaoyang Huang (Wharton, U-Penn)).
This is based on a dynamical spectral transition: along the trajectory of the optimization path, the Gram matrix or the Hessian matrix develop BBP outliers which carry these effective dynamics.
I will illustrate the use of this point of view on a few central examples of ML: multilayer neural nets for classification (of Gaussian mixtures), and the XOR examples, for instance.

Aucun document.

Summer school EUR MINT 2024 "Random Matrices and Free Probability"

Contact

Random matrices and dynamics of optimization in very high dimensions 1/3

Amphithéâtre Schwartz

Institut de Mathématiques

Orateur

Description

Documents de présentation

Sous-contributions