This is a person detector that is based on Resnet50 backbone with VFNet head for 1344x800 resolution.
Metric | Value |
---|---|
AP @ [ IoU=0.50:0.95 ] | 0.439 (internal test set) |
GFlops | 79318.2158 |
MParams | 55.5570 |
Source framework | PyTorch* |
Average Precision (AP) is defined as an area under the precision/recall curve.
Image, name: image
, shape: 1, 3, 800, 1344
in the format B, C, H, W
, where:
B
- batch sizeC
- number of channelsH
- image heightW
- image width
Expected color order is BGR
.
-
The
boxes
is a blob with the shape100, 5
in the formatN, 5
, whereN
is the number of detected bounding boxes. For each detection, the description has the format: [x_min
,y_min
,x_max
,y_max
,conf
], where:- (
x_min
,y_min
) - coordinates of the top left bounding box corner - (
x_max
,y_max
) - coordinates of the bottom right bounding box corner. conf
- confidence for the predicted class
- (
-
The
labels
is a blob with the shape100
in the formatN
, whereN
is the number of detected bounding boxes. In case of person detection, it is equal to1
for each detected box with person in it and0
for the background.
The model can be used in the following demos provided by the Open Model Zoo to show its capabilities:
[*] Other names and brands may be claimed as the property of others.