TY - GEN
T1 - Improving Generative Data Augmentation with Prior-Knowledge for Dysplasia Grading of Colorectal Polyps
AU - Craparotta, Roberto
AU - Ivanov, Desislav
AU - Barbano, Carlo Alberto
AU - Tartaglione, Enzo
AU - Gambella, Alessandro
AU - Cavallo, Luca
AU - Cassoni, Paola
AU - Bertero, Luca
AU - Grangetto, Marco
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026/1/1
Y1 - 2026/1/1
N2 - Generative models (GANs or Diffusion Models) have achieved state-of-the-art performance in image synthesis in various domains, including the medical field. Many works have shown that it is possible to fully train deep models (e.g. classifiers) solely based on synthesized data and that augmenting real data with synthesized samples can lead to superior discriminative power. However, large datasets are needed to train realistic generative models. On the other hand, this can be an issue in certain domains, such as medical imaging, where datasets are typically smaller and more costly to annotate. Small sample size poses a fundamental problem in the applicability of generative models for data augmentation, as a generative model will not necessarily approximate the real data distribution better than a simple classifier in a low-data regime. To tackle this issue, we introduce prior information in the generative process to compensate for the lack of data, unlocking generative augmentation in low-data settings. In this work, we focus on computational pathology, specifically on the sensitive topic of the classification of colorectal polyp dysplasia. To guide the generative process, we take advantage of medical knowledge on tissue morphology taken from the World Health Organization (WHO) guidelines for the classification of dysplasia. By incorporating our proposed generative pipeline into a contrastive learning framework, we achieve state-of-the-art results in the detection of high-grade dysplasia on the UnitoPatho dataset.
AB - Generative models (GANs or Diffusion Models) have achieved state-of-the-art performance in image synthesis in various domains, including the medical field. Many works have shown that it is possible to fully train deep models (e.g. classifiers) solely based on synthesized data and that augmenting real data with synthesized samples can lead to superior discriminative power. However, large datasets are needed to train realistic generative models. On the other hand, this can be an issue in certain domains, such as medical imaging, where datasets are typically smaller and more costly to annotate. Small sample size poses a fundamental problem in the applicability of generative models for data augmentation, as a generative model will not necessarily approximate the real data distribution better than a simple classifier in a low-data regime. To tackle this issue, we introduce prior information in the generative process to compensate for the lack of data, unlocking generative augmentation in low-data settings. In this work, we focus on computational pathology, specifically on the sensitive topic of the classification of colorectal polyp dysplasia. To guide the generative process, we take advantage of medical knowledge on tissue morphology taken from the World Health Organization (WHO) guidelines for the classification of dysplasia. By incorporating our proposed generative pipeline into a contrastive learning framework, we achieve state-of-the-art results in the detection of high-grade dysplasia on the UnitoPatho dataset.
KW - Computational pathology
KW - Contrastive learning
KW - Generative models
UR - https://www.scopus.com/pages/publications/105027544407
U2 - 10.1007/978-3-032-10185-3_45
DO - 10.1007/978-3-032-10185-3_45
M3 - Conference contribution
AN - SCOPUS:105027544407
SN - 9783032101846
T3 - Lecture Notes in Computer Science
SP - 571
EP - 583
BT - Image Analysis and Processing – ICIAP 2025 - 23rd International Conference, Proceedings
A2 - Rodolà, Emanuele
A2 - Galasso, Fabio
A2 - Masi, Iacopo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 23rd International Conference on Image Analysis and Processing, ICIAP 2025
Y2 - 15 September 2025 through 19 September 2025
ER -