5 septembre 2022 à 9 décembre 2022
IHP
Fuseau horaire Europe/Paris
Financial support for the participation to the quarter is now closed

Peter Bartlett - The Dynamics of Sharpness-Aware Minimization.

6 oct. 2022, 15:30
1h
Amphitheater Hermite, IHP

Amphitheater Hermite, IHP

Description

Optimization methodology has been observed to affect statistical performance in
high-dimensional prediction problems, and there has been considerable effort devoted
to understanding the behavior of optimization methods and the nature of solutions
that they find. We consider Sharpness-Aware Minimization (SAM), a gradient-based
optimization method that has exhibited performance improvements over gradient de-
scent on image and language prediction problems using deep networks. We show that when SAM is applied with a convex quadratic objective, for most random initializa-
tions it converges to oscillating between either side of the minimum in the direction
with the largest curvature, and we provide bounds on the rate of convergence. In
the non-quadratic case, we show that such oscillations encourage drift toward wider
minima by effectively performing gradient descent, on a slower time scale, on the
spectral norm of the Hessian. (Based on joint work with Olivier Bousquet and Phil
Long)

Documents de présentation

Aucun document.