Passer à la navigation principale Passer à la recherche Passer au contenu principal

Sparse Double Descent in Vision Transformers: Real or Phantom Threat?

  • Institut Polytechnique de Paris

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Vision transformers (ViT) have been of broad interest in recent theoretical and empirical works. They are state-of-the-art thanks to their attention-based approach, which boosts the identification of key features and patterns within images thanks to the capability of avoiding inductive bias, resulting in highly accurate image analysis. Meanwhile, neoteric studies have reported a “sparse double descent” phenomenon that can occur in modern deep-learning models, where extremely over-parametrized models can generalize well. This raises practical questions about the optimal size of the model and the quest over finding the best trade-off between sparsity and performance is launched: are Vision Transformers also prone to sparse double descent? Can we find a way to avoid such a phenomenon? Our work tackles the occurrence of sparse double descent on ViTs. Despite some works that have shown that traditional architectures, like Resnet, are condemned to the sparse double descent phenomenon, for ViTs we observe that an optimally-tuned $$\ell _2$$ regularization relieves such a phenomenon. However, everything comes at a cost: optimal lambda will sacrifice the potential compression of the ViT.

langue originaleAnglais
titreImage Analysis and Processing – ICIAP 2023 - 22nd International Conference, ICIAP 2023, Proceedings
rédacteurs en chefGian Luca Foresti, Andrea Fusiello, Edwin Hancock
EditeurSpringer Science and Business Media Deutschland GmbH
Pages490-502
Nombre de pages13
ISBN (imprimé)9783031431524
Les DOIs
étatPublié - 1 janv. 2023
Evénement22nd International Conference on Image Analysis and Processing, ICIAP 2023 - Udine, Italie
Durée: 11 sept. 202315 sept. 2023

Série de publications

NomLecture Notes in Computer Science
Volume14234 LNCS
ISSN (imprimé)0302-9743
ISSN (Electronique)1611-3349

Une conférence

Une conférence22nd International Conference on Image Analysis and Processing, ICIAP 2023
Pays/TerritoireItalie
La villeUdine
période11/09/2315/09/23

Empreinte digitale

Examiner les sujets de recherche de « Sparse Double Descent in Vision Transformers: Real or Phantom Threat? ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation