Passer à la navigation principale Passer à la recherche Passer au contenu principal

Say My Name: a Model’s Bias Discovery Framework

  • Massimiliano Ciranni
  • , Luca Molinaro
  • , Carlo Alberto Barbano
  • , Attilio Fiandrotti
  • , Vittorio Murino
  • , Vito Paolo Pastore
  • , Enzo Tartaglione

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

Due to the broad applicability of deep learning to downstream tasks and end-to-end training capabilities in the last few years, increasingly more concerns about potential biases to specific, non-representative patterns have been raised. Many works focusing on unsupervised debiasing leverage the tendency of deep models to learn “easier” samples, for example, by clustering the latent space to obtain bias pseudo-labels. However, their interpretation is not trivial as it does not provide semantic information about the bias features. To address this issue, we introduce “Say My Name” (SaMyNa), a tool to identify semantically potential biases within deep models. Unlike existing methods, our approach focuses on concepts learned by the model, enhancing explainability through a text-based pipeline. Applicable during either training or post-hoc validation, our method can disentangle task-related information and propose itself as a tool to discover biases. Evaluation on typical benchmarks demonstrates its effectiveness in detecting biases and even disclaiming them. When sided with a traditional debiasing approach for bias mitigation, it can achieve state-of-the-art performance while having the advantage of associating a semantic meaning with the discovered bias. The code is available at https://github.com/SayMyName-BiasNaming/samyna-tmlr.

langue originaleAnglais
journalTransactions on Machine Learning Research
Volume2025-November
étatPublié - 1 janv. 2025

Empreinte digitale

Examiner les sujets de recherche de « Say My Name: a Model’s Bias Discovery Framework ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation