Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation (SIPE)
The implementation of Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation, Qi Chen, Lingxiao Yang, Jianhuang Lai, and Xiaohua Xie, CVPR 2022.
Weakly Supervised Semantic Segmentation (WSSS) based on image-level labels has attracted much attention due to low annotation costs. Existing methods often rely on Class Activation Mapping (CAM) that measures the correlation between image pixels and classifier weight. However, the classifier focuses only on the discriminative regions while ignoring other useful information in each image, resulting in incomplete localization maps. To address this issue, we propose a Self-supervised Image-specific Prototype Exploration (SIPE) that consists of an Image-specific Prototype Exploration (IPE) and a General-Specific Consistency (GSC) loss. Specifically, IPE tailors prototypes for every image to capture complete regions, formed our Image-Specific CAM (IS-CAM), which is realized by two sequential steps. In addition, GSC is proposed to construct the consistency of general CAM and our specific IS-CAM, which further optimizes the feature representation and empowers a self-correction ability of prototype exploration. Extensive experiments are conducted on PASCAL VOC 2012 and MS COCO 2014 segmentation benchmark and results show our SIPE achieves new state-of-the-art performance using only image-level labels.
- Python >= 3.6.6
- Pytorch >= 1.6.0
- Torchvision
# PASCAL VOC 2012
bash run_voc.sh
# MS COCO 2014
bash run_coco.sh
To train fully supervised segmentation models, we refer to deeplab-pytorch and seamv1.
Dataset | Model | mIoU (Train) | Weight | Training log |
---|---|---|---|---|
PASCAL VOC 2012 | CVPR submit | 58.65 | Download | Logfile |
PASCAL VOC 2012 | This repo | 58.88 | Download | Logfile |
MS COCO 2014 | CVPR submit | 34.41 | Download | Logfile |
MS COCO 2014 | This repo | 35.05 | Download | Logfile |
Dataset | Model | mIoU (Val) | mIoU (Test) | Weight |
---|---|---|---|---|
PASCAL VOC 2012 | WideResNet38 | 68.2 | 69.5 | Download |
PASCAL VOC 2012 | ResNet101 | 68.8 | 69.7 | Download |
MS COCO 2014 | WideResNet38 | 43.6 | - | Download |
MS COCO 2014 | ResNet101 | 40.6 | - | Download |
@InProceedings{Chen_2022_CVPR_SIPE,
author = {Chen, Qi and Yang, Lingxiao and Lai, Jian-Huang and Xie, Xiaohua},
title = {Self-Supervised Image-Specific Prototype Exploration for Weakly Supervised Semantic Segmentation},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2022},
pages = {4288-4298}
}