TY - JOUR
T1 - Spatially-Consistent Feature Matching and Learning for Heritage Image Analysis
AU - Shen, Xi
AU - Champenois, Robin
AU - Ginosar, Shiry
AU - Pastrolin, Ilaria
AU - Rousselot, Morgane
AU - Bounou, Oumayma
AU - Monnier, Tom
AU - Gidaris, Spyros
AU - Bougard, François
AU - Raverdy, Pierre Guillaume
AU - Limon, Marie Françoise
AU - Bénévent, Christine
AU - Smith, Marc
AU - Poncet, Olivier
AU - Bender, K.
AU - Joyeux-Prunel, Béatrice
AU - Honig, Elizabeth
AU - Efros, Alexei A.
AU - Aubry, Mathieu
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2022/5/1
Y1 - 2022/5/1
N2 - Progress in the digitization of cultural assets leads to online databases that become too large for a human to analyze. Moreover, some analyses might be challenging, even for experts. In this paper, we explore two applications of computer vision to analyze historical data: watermark recognition and one-shot repeated pattern detection in artwork collections. Both problems present computer vision challenges which we believe to be representative of the ones encountered in cultural heritage applications: limited supervision is available, the tasks are fine-grained recognition, and the data comes in several different modalities. Both applications are also highly practical, as recognizing watermarks makes it possible to date and locate documents, while detecting repeated patterns allows exploring visual links between artworks. We demonstrate on both tasks the benefits of relying on deep mid-level features. More precisely, we define an image similarity score based on geometric verification of mid-level features and show how spatial consistency can be used to fine-tune out-of-the-box features for the target dataset with weak or no supervision. This paper relates and extends our previous works (Shen et al. in Discovering visual patterns in art collections with spatially-consistent feature learning, 2019; Shen et al. in Large-scale historical watermark recognition dataset and a new consistency-based approach, 2020). Our code and data are available at http://imagine.enpc.fr/~shenx/HisImgAnalysis/.
AB - Progress in the digitization of cultural assets leads to online databases that become too large for a human to analyze. Moreover, some analyses might be challenging, even for experts. In this paper, we explore two applications of computer vision to analyze historical data: watermark recognition and one-shot repeated pattern detection in artwork collections. Both problems present computer vision challenges which we believe to be representative of the ones encountered in cultural heritage applications: limited supervision is available, the tasks are fine-grained recognition, and the data comes in several different modalities. Both applications are also highly practical, as recognizing watermarks makes it possible to date and locate documents, while detecting repeated patterns allows exploring visual links between artworks. We demonstrate on both tasks the benefits of relying on deep mid-level features. More precisely, we define an image similarity score based on geometric verification of mid-level features and show how spatial consistency can be used to fine-tune out-of-the-box features for the target dataset with weak or no supervision. This paper relates and extends our previous works (Shen et al. in Discovering visual patterns in art collections with spatially-consistent feature learning, 2019; Shen et al. in Large-scale historical watermark recognition dataset and a new consistency-based approach, 2020). Our code and data are available at http://imagine.enpc.fr/~shenx/HisImgAnalysis/.
KW - Artwork analysis
KW - Feature learning
KW - Self-supervised learning
KW - Watermark recognition
UR - https://www.scopus.com/pages/publications/85127255514
U2 - 10.1007/s11263-022-01576-x
DO - 10.1007/s11263-022-01576-x
M3 - Article
AN - SCOPUS:85127255514
SN - 0920-5691
VL - 130
SP - 1325
EP - 1339
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
IS - 5
ER -