17–20 oct. 2023
LAAS
Fuseau horaire Europe/Paris

Clustering in large language models: an interacting particle systems perspective

20 oct. 2023, 09:00
1h
Salle de Conférence (LAAS)

Salle de Conférence

LAAS

Orateur

Borjan Geshkovski

Description

With remarkable empirical success, Transformers enable large language models to compute succinct representations of data using the self-attention mechanism. We model these architectures as interacting particle systems in the spirit of models in collective behaviour and opinion dynamics, allowing us to show the appearance of various clustering/coagulation phenomena. Associated control problems will also be discussed. Based on joint work with Cyril Letrouit, Yury Polyanskiy, and Philippe Rigollet.

Documents de présentation