Results GTX 1080

The mini batch size is 16

The environment for the results listed above is as follows:

The TensorFlow version was very recent, it has to be in order for it to work with CUDA 8.0.

(1) The Torch benchmark is from https://github.com/jcjohnson/cnn-benchmarks (it has an essentially identical setup, VGG-16, GTX 1080, CUDA 8, cuDNN 5, minibatch size 16).

(2) The time is for a complete SGD step including parameter updates, not just the forward+backward time.

Provide feedback