Historical Astronomical Diagrams Decomposition in Geometric Primitives

  • Syrine Kalleli
  • , Scott Trigg
  • , Ségolène Albouy
  • , Matthieu Husson
  • , Mathieu Aubry

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Automatically extracting the geometric content from the hundreds of thousands of diagrams drawn in historical manuscripts would enable historians to study the diffusion of astronomical knowledge on a global scale. However, state-of-the-art vectorization methods, often designed to tackle modern data, are not adapted to the complexity and diversity of historical astronomical diagrams. Our contribution is thus twofold. First, we introduce a unique dataset of 303 astronomical diagrams from diverse traditions, ranging from the XIIth to the XVIIIth century, annotated with more than 3000 line segments, circles and arcs. Second, we develop a model that builds on DINO-DETR to enable the prediction of multiple geometric primitives. We show that it can be trained solely on synthetic data and accurately predict primitives on our challenging dataset. Our approach widely improves over the LETR baseline, which is restricted to lines, by introducing a meaningful parametrization for multiple primitives, jointly training for detection and parameter refinement, using deformable attention and training on rich synthetic data. Our dataset and code are available on our webpage: http://imagine.enpc.fr/~kallelis/icdar2024/.

Original languageEnglish
Title of host publicationDocument Analysis and Recognition - ICDAR 2024 - 18th International Conference, Proceedings
EditorsElisa H. Barney Smith, Marcus Liwicki, Liangrui Peng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages108-125
Number of pages18
ISBN (Print)9783031705427
DOIs
Publication statusPublished - 1 Jan 2024
Externally publishedYes
Event18th International Conference on Document Analysis and Recognition, ICDAR 2024 - Athens, Greece
Duration: 30 Aug 20244 Sept 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14806 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th International Conference on Document Analysis and Recognition, ICDAR 2024
Country/TerritoryGreece
CityAthens
Period30/08/244/09/24

Keywords

  • Historical diagrams
  • Transformers
  • Vectorization

Fingerprint

Dive into the research topics of 'Historical Astronomical Diagrams Decomposition in Geometric Primitives'. Together they form a unique fingerprint.

Cite this