Skip to content

Latest commit

 

History

History
36 lines (27 loc) · 6.51 KB

File metadata and controls

36 lines (27 loc) · 6.51 KB

QueryInst

Instances as Queries

Abstract

We present QueryInst, a new perspective for instance segmentation. QueryInst is a multi-stage end-to-end system that treats instances of interest as learnable queries, enabling query based object detectors, e.g., Sparse R-CNN, to have strong instance segmentation performance. The attributes of instances such as categories, bounding boxes, instance masks, and instance association embeddings are represented by queries in a unified manner. In QueryInst, a query is shared by both detection and segmentation via dynamic convolutions and driven by parallelly-supervised multi-stage learning. We conduct extensive experiments on three challenging benchmarks, i.e., COCO, CityScapes, and YouTube-VIS to evaluate the effectiveness of QueryInst in object detection, instance segmentation, and video instance segmentation tasks. For the first time, we demonstrate that a simple end-to-end query based framework can achieve the state-of-the-art performance in various instance-level recognition tasks.

Results and Models

Model Backbone Style Lr schd Number of Proposals Multi-Scale RandomCrop box AP mask AP Config Download
QueryInst R-50-FPN pytorch 1x 100 False False 42.0 37.5 config model | log
QueryInst R-50-FPN pytorch 3x 100 True False 44.8 39.8 config model | log
QueryInst R-50-FPN pytorch 3x 300 True True 47.5 41.7 config model | log
QueryInst R-101-FPN pytorch 3x 100 True False 46.4 41.0 config model | log
QueryInst R-101-FPN pytorch 3x 300 True True 49.0 42.9 config model | log

Citation

@InProceedings{Fang_2021_ICCV,
    author    = {Fang, Yuxin and Yang, Shusheng and Wang, Xinggang and Li, Yu and Fang, Chen and Shan, Ying and Feng, Bin and Liu, Wenyu},
    title     = {Instances As Queries},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {6910-6919}
}