Accurate Boundary Alignment and Realism Enhancement for Colonoscopic Polyp Image-Mask Pair Generation
Sep 20, 2025·
,

,,
Riyu Qiu
Kun Xia
Feng GAO
Shu-Ting YANG
Du CAI
Jiacheng Wang
Yinran Chen
Corresponding Author
,Liansheng Wang
Corresponding Author
·
0 min readAbstract
Polyp segmentation is the foundation of colonoscopic lesion screening, diagnosis, and therapy. However, the data size of images and annotations is limited. The latent diffusion model (LDM) has emerged as a powerful tool in synthesizing high-quality medical images with low computational costs. However, the challenges of boundary-aligned image-mask pairs and image realism remain unresolved, showing that (i) the spatial relationship between the boundaries is easily distorted in the latent space; (ii) the diversity of colors, shapes, and textures, along with low boundary contrast and textures similar to surrounding tissue, makes boundary distinction of the polyps difficult. This paper proposes Polyp-LDM that encodes polyps and masks into the same latent space via a unified variational autoencoder (VAE) to align their boundaries. Furthermore, Polyp-LDM refines texture and lighting while preserving the structure by fine-tuning the VAE decoder with data augmentation and applying the style cloning module to enhance image realism. Quantitative evaluations and user preference study demonstrate that our method outperforms existing methods in image-mask pair generation. Moreover, segmentation models trained with augmented data generated by polyp-LDM achieve the best performance on three public polyp datasets.
Type
Publication
International Conference on Medical Image Computing and Computer-Assisted Intervention

Authors
Professor
My research leverages AI and big data to improve diagnostics, prognostics, and ultimately, outcomes in cancer and other biomedical fields.

Authors
Physician
An endocrinologist passionate about artificial intelligence and diabetes, focusing on applying machine learning and bioinformatics to diabetes heterogeneity research.

Authors
Postdoc
I focus on leveraging explainable AI and large foundation models to advance medical imaging and digital pathology in colorectal cancer research.