inference time on GPU

Jump to bottom

Yihui He 何宜晖 edited this page Sep 21, 2017 · 2 revisions

what influence inference performance?

inference time on GPU depends on several factors

GPU (GTX TITAN X in paper)
Batch size (less than 32 is inefficient)
CUDNN (5 in paper)
CUDA (8 in paper)
Caffe
OS (ubuntu 16 in paper)

The results may varies a bit in each run.

the performance of VGG-16 3C 4X

the performance of VGG-16 3C 4X hasn't been tested in our paper. Mr. 장대석 kindly tested it on GTX1080 (linux):

	3C 4X	Original
FW	3.6ms	6.9ms
BW	4.8ms	5.8ms
Mem	6698 MiB	5156 MiB