KUnet.jl/docs/benchmark.md at master · jmgore75/KUnet.jl · GitHub

Benchmark

KUnet contains a complete backprop implementation in Matlab and pure C/CUDA as well as Julia. I use these (and Caffe) for debugging and benchmarking.

Here are the timing results for my standard backprop test with:

dataset of 76834 instances in 1326 dimensions and 3 classes.
a network with a single hidden layer of 20000 units.
1 epoch with 82 minibatches of 937 instances each.
standard SGD (no momentum) with learningRate=0.01.
a server with Tesla K20m GPU.

Implementation	Seconds/Epoch
Matlab	7.95
Caffe	6.76
Julia	5.52
Cuda	4.87