DiffSeg: Diffusion-Based Weakly Supervised Segmentation for Fibrosis

Enhancing weakly supervised segmentation via controllable image generation

By Gavin Yue in AI Computer Vision Medical

Figure 1. Pulmonary fibrosis illustartion. Fibrosis can cause irreversible lung damage and is associated with increased mortality.

Publication & Presentation

This project was published at IEEE ISBI 2025 and featured as an invited presentation in April 2025, Houston, TX, USA.


Introduction

Fibrotic Lung Disease (FLD) is a life-threatening condition characterised by progressive lung scarring, responsible for 1% of UK deaths in 2021. However, fibrosis appears in irregular and diffuse patterns with poorly defined boundaries, making manual segmentation highly variable and labor-intensive.

To address this, we propose an automated segmentation pipeline that avoids pixel-level annotations and instead relies on image-level binary labels, reducing human annotation effort while maintaining accuracy.

Figure 2. Challenges in Fibrosis Segmentation

Method

We introduce DiffSeg, a novel diffusion-based weakly supervised semantic segmentation framework. Key innovations include:

  • Image-level supervision: Only binary (yes/no) fibrosis labels are used.
  • Controllable image generation: Using a diffusion model, healthy CT slices are transformed into synthetic images with controllable fibrosis severity.
  • Frozen classifier and refinement pipeline: Paired healthy and fibrosis-injected images help guide a classifier to generate accurate pseudo masks.

The pipeline synthesizes diverse training data and leverages pseudo-label refinement for robust segmentation, even in complex pathological patterns.

Figure 3. Workflow Overview

Results

DiffSeg achieves state-of-the-art performance with a Dice score of 61.75% (IQR: 52.37–70.02%), outperforming existing methods:

MethodSupervisionBackboneDice (%)
MedSAM (Fine)Fine-boxViT-B40.17
MedSAM (Single)Single-boxViT-B26.31
DuPLImage-levelViT-B19.45
COINImage-levelC-GAN27.89
DiffSegImage-levelDiffusion61.75

Visual comparisons show that DiffSeg closely resembles ground truth masks with finer details and minimal noise, demonstrating strong generalizability without relying on pixel-wise labels.

Figure 4. Segmentation Results Visulisation

Conclusion

DiffSeg is a robust and efficient framework for weakly supervised segmentation in medical imaging. By leveraging generative diffusion models, it:

  • Reduces the need for expensive expert annotations,
  • Achieves competitive segmentation performance, and
  • Enables fine-grained analysis of complex fibrosis patterns.

This work opens new directions for scalable, label-efficient medical AI systems, particularly where expert annotations are difficult or costly to obtain.


πŸ“„ Full Poster


Project Links:
πŸ“„ Paper on arXiv
πŸ“« Contact: zhilinggavin@gmail.com
πŸ”— LinkedIn

Posted on:
March 1, 2025
Length:
2 minute read, 351 words
Categories:
AI Computer Vision Medical
See Also: