From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport

  • Quentin Bouniot
  • , Ievgen Redko
  • , Anton Mallasto
  • , Charlotte Laclau
  • , Oliver Struckmeier
  • , Karol Arndt
  • , Markus Heinonen
  • , Ville Kyrki
  • , Samuel Kaski

Research output: Contribution to journalConference articlepeer-review

Abstract

In the last decade, we have witnessed the introduction of several novel deep neural network (DNN) architectures exhibiting ever-increasing performance across diverse tasks. Explaining the upward trend of their performance, however, remains difficult as different DNN architectures of comparable depth and width - common factors associated with their expressive power - may exhibit a drastically different performance even when trained on the same dataset. In this paper, we introduce the concept of the non-linearity signature of DNN, the first theoretically sound solution for approximately measuring the non-linearity of deep neural networks. Built upon a score derived from closed-form optimal transport mappings, this signature provides a better understanding of the inner workings of a wide range of DNN architectures and learning paradigms, with a particular emphasis on the computer vision task. We provide extensive experimental results that highlight the practical usefulness of the proposed non-linearity signature and its potential for long-reaching implications. The code for our work is available at https://github.com/qbouniot/AffScoreDeep.

Original languageEnglish
Pages (from-to)25250-25260
Number of pages11
JournalProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
DOIs
Publication statusPublished - 1 Jan 2025
Event2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 - Nashville, United States
Duration: 11 Jun 202515 Jun 2025

Keywords

  • activation functions
  • deep neural networks
  • optimal transport

Fingerprint

Dive into the research topics of 'From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport'. Together they form a unique fingerprint.

Cite this