English Intern
  • none
Institut für Mathematik

Oberseminar "Mathematik des Maschinellen Lernens und Angewandte Analysis" - M.Sc. Albert Alcalde

Gaussian Mean-Field Dynamics of Unnormalized Transformers
Datum: 28.01.2026, 14:15 - 15:15 Uhr
Kategorie: Veranstaltung
Ort: Hubland Nord, Geb. 40, 01.003
Veranstalter: Lehrstuhl für Mathematik III (Maschinelles Lernen)
Vortragende: Albert Alcalde, Universität Erlangen

We study unnormalized Transformers with self-attention and affine feed-forward layers in a continuous-depth, mean-field setting, focusing on a Gaussian regime in which the dynamics reduce to coupled equations for the mean and covariance. This reduction yields a simple and analytically tractable model that connects Transformer dynamics with classical ideas from control theory and stability analysis. We derive general qualitative properties of the resulting system, including basic controllability features, convergence and instability regimes, and the dependence of long-time behavior on parameter choices. Numerical experiments further suggest that Gaussian statistics may persist across several layers in trained encoder Transformers, indicating that this framework can serve as a simplified setting for exploring stability-related phenomena in Transformer models.

Zurück