DiffSeg: Diffusion-Based Weakly Supervised Segmentation for Fibrosis
Enhancing weakly supervised segmentation via controllable image generation
By Gavin Yue in AI Computer Vision Medical



Publication & Presentation
This project was published at IEEE ISBI 2025 and featured as an invited presentation in April 2025, Houston, TX, USA.
Introduction
Fibrotic Lung Disease (FLD) is a life-threatening condition characterised by progressive lung scarring, responsible for 1% of UK deaths in 2021. However, fibrosis appears in irregular and diffuse patterns with poorly defined boundaries, making manual segmentation highly variable and labor-intensive.
To address this, we propose an automated segmentation pipeline that avoids pixel-level annotations and instead relies on image-level binary labels, reducing human annotation effort while maintaining accuracy.

Method
We introduce DiffSeg, a novel diffusion-based weakly supervised semantic segmentation framework. Key innovations include:
- Image-level supervision: Only binary (yes/no) fibrosis labels are used.
- Controllable image generation: Using a diffusion model, healthy CT slices are transformed into synthetic images with controllable fibrosis severity.
- Frozen classifier and refinement pipeline: Paired healthy and fibrosis-injected images help guide a classifier to generate accurate pseudo masks.
The pipeline synthesizes diverse training data and leverages pseudo-label refinement for robust segmentation, even in complex pathological patterns.

Results
DiffSeg achieves state-of-the-art performance with a Dice score of 61.75% (IQR: 52.37β70.02%), outperforming existing methods:
| Method | Supervision | Backbone | Dice (%) |
|---|---|---|---|
| MedSAM (Fine) | Fine-box | ViT-B | 40.17 |
| MedSAM (Single) | Single-box | ViT-B | 26.31 |
| DuPL | Image-level | ViT-B | 19.45 |
| COIN | Image-level | C-GAN | 27.89 |
| DiffSeg | Image-level | Diffusion | 61.75 |
Visual comparisons show that DiffSeg closely resembles ground truth masks with finer details and minimal noise, demonstrating strong generalizability without relying on pixel-wise labels.

Conclusion
DiffSeg is a robust and efficient framework for weakly supervised segmentation in medical imaging. By leveraging generative diffusion models, it:
- Reduces the need for expensive expert annotations,
- Achieves competitive segmentation performance, and
- Enables fine-grained analysis of complex fibrosis patterns.
This work opens new directions for scalable, label-efficient medical AI systems, particularly where expert annotations are difficult or costly to obtain.
π Full Poster
Project Links:
π Paper on arXiv
π« Contact: zhilinggavin@gmail.com
π LinkedIn
- Posted on:
- March 1, 2025
- Length:
- 2 minute read, 351 words
- Categories:
- AI Computer Vision Medical
- See Also: