TOWARDS UNDERSTANDING WHY LABEL SMOOTHING DEGRADES SELECTIVE CLASSIFICATION AND HOW TO FIX IT

Guoxuan Xia, Olivier Laurent, Gianni Franchi, Christos Savvas Bouganis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Label smoothing (LS) is a popular regularisation method for training neural networks as it is effective in improving test accuracy and is simple to implement. “Hard” one-hot labels are “smoothed” by uniformly distributing probability mass to other classes, reducing overfitting. Prior work has suggested that in some cases LS can degrade selective classification (SC) - where the aim is to reject misclassifications using a model's uncertainty. In this work, we first demonstrate empirically across an extended range of large-scale tasks and architectures that LS consistently degrades SC. We then address a gap in existing knowledge, providing an explanation for this behaviour by analysing logit-level gradients: LS degrades the uncertainty rank ordering of correct vs incorrect predictions by suppressing the max logit more when a prediction is likely to be correct, and less when it is likely to be wrong. This elucidates previously reported experimental results where strong classifiers underperform in SC. We then demonstrate the empirical effectiveness of post-hoc logit normalisation for recovering lost SC performance caused by LS. Furthermore, linking back to our gradient analysis, we again provide an explanation for why such normalisation is effective. Project page: https://ensta-u2is-ai.github.io/Understanding-Label-smoothing-Selective-classification/.

Original languageEnglish
Title of host publication13th International Conference on Learning Representations, ICLR 2025
PublisherInternational Conference on Learning Representations, ICLR
Pages11566-11599
Number of pages34
ISBN (Electronic)9798331320850
Publication statusPublished - 1 Jan 2025
Externally publishedYes
Event13th International Conference on Learning Representations, ICLR 2025 - Singapore, Singapore
Duration: 24 Apr 202528 Apr 2025

Publication series

Name13th International Conference on Learning Representations, ICLR 2025

Conference

Conference13th International Conference on Learning Representations, ICLR 2025
Country/TerritorySingapore
CitySingapore
Period24/04/2528/04/25

Fingerprint

Dive into the research topics of 'TOWARDS UNDERSTANDING WHY LABEL SMOOTHING DEGRADES SELECTIVE CLASSIFICATION AND HOW TO FIX IT'. Together they form a unique fingerprint.

Cite this