SeReNe: Sensitivity-Based Regularization of Neurons for Structured Sparsity in Neural Networks

  • Enzo Tartaglione
  • , Andrea Bragagnolo
  • , Francesco Odierna
  • , Attilio Fiandrotti
  • , Marco Grangetto

Research output: Contribution to journalArticlepeer-review

Abstract

Deep neural networks include millions of learnable parameters, making their deployment over resource-constrained devices problematic. Sensitivity-based regularization of neurons (SeReNe) is a method for learning sparse topologies with a structure, exploiting neural sensitivity as a regularizer. We define the sensitivity of a neuron as the variation of the network output with respect to the variation of the activity of the neuron. The lower the sensitivity of a neuron, the less the network output is perturbed if the neuron output changes. By including the neuron sensitivity in the cost function as a regularization term, we are able to prune neurons with low sensitivity. As entire neurons are pruned rather than single parameters, practical network footprint reduction becomes possible. Our experimental results on multiple network architectures and datasets yield competitive compression ratios with respect to state-of-the-art references.

Original languageEnglish
Pages (from-to)7237-7250
Number of pages14
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume33
Issue number12
DOIs
Publication statusPublished - 1 Dec 2022
Externally publishedYes

Keywords

  • Compression
  • deep networks
  • pruning
  • regularization
  • sparse networks

Fingerprint

Dive into the research topics of 'SeReNe: Sensitivity-Based Regularization of Neurons for Structured Sparsity in Neural Networks'. Together they form a unique fingerprint.

Cite this