This is a person detector that is based on MobileNetV2 backbone with ATSS head for 864x480 resolution.
Metric | Value |
---|---|
AP @ [ IoU=0.50:0.95 ] | 0.408 (internal test set) |
GFlops | 6.519 |
MParams | 2.394 |
Source framework | PyTorch* |
Average Precision (AP) is defined as an area under the precision/recall curve.
Image, name: image
, shape: 1, 3, 480, 864
in the format B, C, H, W
, where:
B
- batch sizeC
- number of channelsH
- image heightW
- image width
Expected color order is BGR
.
-
The
boxes
is a blob with the shape100, 5
in the formatN, 5
, whereN
is the number of detected bounding boxes. For each detection, the description has the format: [x_min
,y_min
,x_max
,y_max
,conf
], where:- (
x_min
,y_min
) - coordinates of the top left bounding box corner - (
x_max
,y_max
) - coordinates of the bottom right bounding box corner. conf
- confidence for the predicted class
- (
-
The
labels
is a blob with the shape100
in the formatN
, whereN
is the number of detected bounding boxes. In case of person detection, it is equal to1
for each detected box with person in it and0
for the background.
The OpenVINO Training Extensions provide a training pipeline, allowing to fine-tune the model on custom dataset.
The model can be used in the following demos provided by the Open Model Zoo to show its capabilities:
[*] Other names and brands may be claimed as the property of others.