KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training

  • Truong Thao Nguyen
  • , Balazs Gerofi
  • , Edgar Josafat Martinez-Noriega
  • , François Trahay
  • , Mohamed Wahib

Research output: Contribution to journalConference articlepeer-review

Abstract

This paper proposes a method for hiding the least-important samples during the training of deep neural networks to increase efficiency, i.e., to reduce the cost of training. Using information about the loss and prediction confidence during training, we adaptively find samples to exclude in a given epoch based on their contribution to the overall learning process, without significantly degrading accuracy. We explore the converge properties when accounting for the reduction in the number of SGD updates. Empirical results on various large-scale datasets and models used directly in image classification and segmentation show that while the with-replacement importance sampling algorithm performs poorly on large datasets, our method can reduce total training time by up to 22% impacting accuracy only by 0.4% compared to the baseline. Code available at https://github.com/TruongThaoNguyen/kakurenbo.

Original languageEnglish
JournalAdvances in Neural Information Processing Systems
Volume36
Publication statusPublished - 1 Jan 2023
Event37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, United States
Duration: 10 Dec 202316 Dec 2023

Fingerprint

Dive into the research topics of 'KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training'. Together they form a unique fingerprint.

Cite this