Skip to content

inference time on GPU

Yihui He 何宜晖 edited this page Sep 21, 2017 · 2 revisions

what influence inference performance?

inference time on GPU depends on several factors

  • GPU (GTX TITAN X in paper)
  • Batch size (less than 32 is inefficient)
  • CUDNN (5 in paper)
  • CUDA (8 in paper)
  • Caffe
  • OS (ubuntu 16 in paper)

The results may varies a bit in each run.

the performance of VGG-16 3C 4X

the performance of VGG-16 3C 4X hasn't been tested in our paper. Mr. 장대석 kindly tested it on GTX1080 (linux):

3C 4X Original
FW 3.6ms 6.9ms
BW 4.8ms 5.8ms
Mem 6698 MiB 5156 MiB