One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts
Paper
•
2312.17183
•
Published
This repo contains the checkpoints for SAT.
We offer SAT-Pro, SAT-Nano (both trained on 72 datasets) and another 5 different variants of SAT-Nano (all trained on 49 datasets):
Check our paper for more details, and github repo for usage instruction.
⚠️ Each model should be used with paired checkpoint and text encoder checkpoint.
In addition, we provide multiple pretrained encoders at ./Pretrain. Enhanced with multi-modal human anatomy knowledge, they significantly boost the segmentation performance and are potentially beneficial for other tasks:
textual_only.pth).multimodal_sat_ds.pth). It can be used to reproduce results in our paper.multimodal_cvpr25.pth). It's explicitly optimized for the challenge.