Geometry and Statistics in Data Sciences, Paris

Name: Geometry and Statistics in Data Sciences, Paris
Start: 2022-09-05T08:30:00+02:00
End: 2022-12-09T18:30:00+01:00
Location: IHP

5 septembre 2022 à 9 décembre 2022

IHP

Fuseau horaire Europe/Paris

Financial support for the participation to the quarter is now closed

Contact

gesda2022@ihp.fr

Adeline Fermanian- Scaling ResNets in the Large-depth Regime

3 oct. 2022, 15:00

Amphitheater Hermite, IHP

Deep ResNets are recognized for achieving state-of-the-art results in complex machine learning tasks. However, the remarkable performance of these architectures relies on a training procedure that needs to be carefully crafted to avoid vanishing or exploding gradients, particularly as the depth L increases. No consensus has been reached on how to mitigate this issue, although a widely discussed strategy consists in scaling the output of each layer by a factor \alpha_L. We show in a probabilistic setting that with standard i.i.d. initializations, the only non-trivial dynamics is for \alpha_L = 1/ \sqrt{L} (other choices lead either to explosion or to identity mapping). This scaling factor corresponds in the continuous-time limit to a neural stochastic differential equation, contrarily to a widespread interpretation that deep ResNets are discretizations of neural ordinary differential equations. By contrast, in the latter regime, stability is obtained with specific correlated initializations and \alpha_L=1/L. Our analysis suggests a strong interplay between scaling and regularity of the weights as a function of the layer index. Finally, in a series of experiments, we exhibit a continuous range of regimes driven by these two parameters, which jointly impact performance before and after training.

Aucun document.

Geometry and Statistics in Data Sciences, Paris

Contact

Adeline Fermanian- Scaling ResNets in the Large-depth Regime

Amphitheater Hermite, IHP

Description

Documents de présentation

Choisissez le fuseau horaire

Geometry and Statistics in Data Sciences, Paris

Contact

Description

Documents de présentation