TY - GEN
T1 - THE RISE OF THE LOTTERY HEROES
T2 - 29th IEEE International Conference on Image Processing, ICIP 2022
AU - Tartaglione, Enzo
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - Recent advances in deep learning optimization showed that just a subset of parameters are really necessary to successfully train a model. Potentially, such a discovery has broad impact from the theory to application; however, it is known that finding these trainable sub-network is a typically costly process. This inhibits practical applications: can the learned sub-graph structures in deep learning models be found at training time? In this work we explore such a possibility, observing and motivating why common approaches typically fail in the extreme scenarios of interest, and proposing an approach which potentially enables training with reduced computational effort. The experiments on either challenging architectures and datasets suggest the algorithmic accessibility over such a computational gain, and in particular a trade-off between accuracy achieved and training complexity deployed emerges.
AB - Recent advances in deep learning optimization showed that just a subset of parameters are really necessary to successfully train a model. Potentially, such a discovery has broad impact from the theory to application; however, it is known that finding these trainable sub-network is a typically costly process. This inhibits practical applications: can the learned sub-graph structures in deep learning models be found at training time? In this work we explore such a possibility, observing and motivating why common approaches typically fail in the extreme scenarios of interest, and proposing an approach which potentially enables training with reduced computational effort. The experiments on either challenging architectures and datasets suggest the algorithmic accessibility over such a computational gain, and in particular a trade-off between accuracy achieved and training complexity deployed emerges.
KW - The lottery ticket hypothesis
KW - computational complexity
KW - deep learning
KW - pruning
UR - https://www.scopus.com/pages/publications/85146643227
U2 - 10.1109/ICIP46576.2022.9897223
DO - 10.1109/ICIP46576.2022.9897223
M3 - Conference contribution
AN - SCOPUS:85146643227
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 2361
EP - 2365
BT - 2022 IEEE International Conference on Image Processing, ICIP 2022 - Proceedings
PB - IEEE Computer Society
Y2 - 16 October 2022 through 19 October 2022
ER -