Masked Cross-image Encoding for Few-shot Segmentation

22 Aug 2023  ·  Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang ·

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images. The key challenge in FSS is to classify the labels of query pixels using class prototypes learned from the few labeled support exemplars. Prior approaches to FSS have typically focused on learning class-wise descriptors independently from support images, thereby ignoring the rich contextual information and mutual dependencies among support-query features. To address this limitation, we propose a joint learning method termed Masked Cross-Image Encoding (MCE), which is designed to capture common visual properties that describe object details and to learn bidirectional inter-image dependencies that enhance feature interaction. MCE is more than a visual representation enrichment module; it also considers cross-image mutual dependencies and implicit guidance. Experiments on FSS benchmarks PASCAL-$5^i$ and COCO-$20^i$ demonstrate the advanced meta-learning ability of the proposed method.

PDF Abstract
No code implementations yet. Submit your code now
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Few-Shot Semantic Segmentation COCO-20i (1-shot) MCE (ResNet-50) Mean IoU 44.22 # 29
Few-Shot Semantic Segmentation COCO-20i (5-shot) MCE (ResNet-50) Mean IoU 51.04 # 24
Few-Shot Semantic Segmentation PASCAL-5i (1-Shot) MCE (ResNet-50) Mean IoU 65.93 # 37
FB-IoU 78.1 # 21
Few-Shot Semantic Segmentation PASCAL-5i (1-Shot) MCE (VGG-16) Mean IoU 62.87 # 64
FB-IoU 74.51 # 37
Few-Shot Semantic Segmentation PASCAL-5i (5-Shot) MCE (ResNet-50) Mean IoU 70.03 # 36
FB-IoU 81.33 # 19
Few-Shot Semantic Segmentation PASCAL-5i (5-Shot) MCE (VGG-16) Mean IoU 68.21 # 50
FB-IoU 78.2 # 32

Methods


No methods listed for this paper. Add relevant methods here