TY - GEN
T1 - Tied spatial transformer networks for digit recognition
AU - Cîrstea, Bogdan Ionuţ
AU - Likforman-Sulem, Laurence
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/7/2
Y1 - 2016/7/2
N2 - This paper reports a new approach based on convolutional neural networks (CNNs), which uses spatial transformer networks (STNs). The approach, referred to as Tied Spatial Transformer Networks (TSTNs), consists of training a system which combines a localization CNN and a classification CNN whose weights are shared. The localization CNN is used for predicting an affine transform for the input image, which is then processed according to the predicted parameters and passed through the classification CNN. We have conducted initial experiments on the cluttered MNIST dataset of noisy digits, comparing the TSTN and STN with identical configurations of trainable parameters, but untied, as well as the classification CNN only, applied to the unprocessed images. In all these cases, we obtain better results using the TSTN. We conjecture that the TSTN provides a regularization effect, as compared to untied STNs. Further experiments seem to support this hypothesis.
AB - This paper reports a new approach based on convolutional neural networks (CNNs), which uses spatial transformer networks (STNs). The approach, referred to as Tied Spatial Transformer Networks (TSTNs), consists of training a system which combines a localization CNN and a classification CNN whose weights are shared. The localization CNN is used for predicting an affine transform for the input image, which is then processed according to the predicted parameters and passed through the classification CNN. We have conducted initial experiments on the cluttered MNIST dataset of noisy digits, comparing the TSTN and STN with identical configurations of trainable parameters, but untied, as well as the classification CNN only, applied to the unprocessed images. In all these cases, we obtain better results using the TSTN. We conjecture that the TSTN provides a regularization effect, as compared to untied STNs. Further experiments seem to support this hypothesis.
KW - Character recognition
KW - Convolutional neural network
KW - Deep learning
KW - Spatial transformer network
U2 - 10.1109/ICFHR.2016.0102
DO - 10.1109/ICFHR.2016.0102
M3 - Conference contribution
AN - SCOPUS:85012899491
T3 - Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR
SP - 524
EP - 529
BT - Proceedings - 2016 15th International Conference on Frontiers in Handwriting Recognition, ICFHR 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th International Conference on Frontiers in Handwriting Recognition, ICFHR 2016
Y2 - 23 October 2016 through 26 October 2016
ER -