TY - GEN
T1 - Historical Astronomical Diagrams Decomposition in Geometric Primitives
AU - Kalleli, Syrine
AU - Trigg, Scott
AU - Albouy, Ségolène
AU - Husson, Matthieu
AU - Aubry, Mathieu
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - Automatically extracting the geometric content from the hundreds of thousands of diagrams drawn in historical manuscripts would enable historians to study the diffusion of astronomical knowledge on a global scale. However, state-of-the-art vectorization methods, often designed to tackle modern data, are not adapted to the complexity and diversity of historical astronomical diagrams. Our contribution is thus twofold. First, we introduce a unique dataset of 303 astronomical diagrams from diverse traditions, ranging from the XIIth to the XVIIIth century, annotated with more than 3000 line segments, circles and arcs. Second, we develop a model that builds on DINO-DETR to enable the prediction of multiple geometric primitives. We show that it can be trained solely on synthetic data and accurately predict primitives on our challenging dataset. Our approach widely improves over the LETR baseline, which is restricted to lines, by introducing a meaningful parametrization for multiple primitives, jointly training for detection and parameter refinement, using deformable attention and training on rich synthetic data. Our dataset and code are available on our webpage: http://imagine.enpc.fr/~kallelis/icdar2024/.
AB - Automatically extracting the geometric content from the hundreds of thousands of diagrams drawn in historical manuscripts would enable historians to study the diffusion of astronomical knowledge on a global scale. However, state-of-the-art vectorization methods, often designed to tackle modern data, are not adapted to the complexity and diversity of historical astronomical diagrams. Our contribution is thus twofold. First, we introduce a unique dataset of 303 astronomical diagrams from diverse traditions, ranging from the XIIth to the XVIIIth century, annotated with more than 3000 line segments, circles and arcs. Second, we develop a model that builds on DINO-DETR to enable the prediction of multiple geometric primitives. We show that it can be trained solely on synthetic data and accurately predict primitives on our challenging dataset. Our approach widely improves over the LETR baseline, which is restricted to lines, by introducing a meaningful parametrization for multiple primitives, jointly training for detection and parameter refinement, using deformable attention and training on rich synthetic data. Our dataset and code are available on our webpage: http://imagine.enpc.fr/~kallelis/icdar2024/.
KW - Historical diagrams
KW - Transformers
KW - Vectorization
UR - https://www.scopus.com/pages/publications/85204572798
U2 - 10.1007/978-3-031-70543-4_7
DO - 10.1007/978-3-031-70543-4_7
M3 - Conference contribution
AN - SCOPUS:85204572798
SN - 9783031705427
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 108
EP - 125
BT - Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Proceedings
A2 - Barney Smith, Elisa H.
A2 - Liwicki, Marcus
A2 - Peng, Liangrui
PB - Springer Science and Business Media Deutschland GmbH
T2 - 18th International Conference on Document Analysis and Recognition, ICDAR 2024
Y2 - 30 August 2024 through 4 September 2024
ER -