Improving Generative Data Augmentation with Prior-Knowledge for Dysplasia Grading of Colorectal Polyps

  • Roberto Craparotta
  • , Desislav Ivanov
  • , Carlo Alberto Barbano
  • , Enzo Tartaglione
  • , Alessandro Gambella
  • , Luca Cavallo
  • , Paola Cassoni
  • , Luca Bertero
  • , Marco Grangetto

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Generative models (GANs or Diffusion Models) have achieved state-of-the-art performance in image synthesis in various domains, including the medical field. Many works have shown that it is possible to fully train deep models (e.g. classifiers) solely based on synthesized data and that augmenting real data with synthesized samples can lead to superior discriminative power. However, large datasets are needed to train realistic generative models. On the other hand, this can be an issue in certain domains, such as medical imaging, where datasets are typically smaller and more costly to annotate. Small sample size poses a fundamental problem in the applicability of generative models for data augmentation, as a generative model will not necessarily approximate the real data distribution better than a simple classifier in a low-data regime. To tackle this issue, we introduce prior information in the generative process to compensate for the lack of data, unlocking generative augmentation in low-data settings. In this work, we focus on computational pathology, specifically on the sensitive topic of the classification of colorectal polyp dysplasia. To guide the generative process, we take advantage of medical knowledge on tissue morphology taken from the World Health Organization (WHO) guidelines for the classification of dysplasia. By incorporating our proposed generative pipeline into a contrastive learning framework, we achieve state-of-the-art results in the detection of high-grade dysplasia on the UnitoPatho dataset.

Original languageEnglish
Title of host publicationImage Analysis and Processing – ICIAP 2025 - 23rd International Conference, Proceedings
EditorsEmanuele Rodolà, Fabio Galasso, Iacopo Masi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages571-583
Number of pages13
ISBN (Print)9783032101846
DOIs
Publication statusPublished - 1 Jan 2026
Event23rd International Conference on Image Analysis and Processing, ICIAP 2025 - Rome, Italy
Duration: 15 Sept 202519 Sept 2025

Publication series

NameLecture Notes in Computer Science
Volume16167 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd International Conference on Image Analysis and Processing, ICIAP 2025
Country/TerritoryItaly
CityRome
Period15/09/2519/09/25

Keywords

  • Computational pathology
  • Contrastive learning
  • Generative models

Fingerprint

Dive into the research topics of 'Improving Generative Data Augmentation with Prior-Knowledge for Dysplasia Grading of Colorectal Polyps'. Together they form a unique fingerprint.

Cite this