Maxim Milakov: November 2015

Nov 24, 2015

nnForge v2.0.1

Hi,

I significantly improved performance of CUDA backend recently in nnForge v2.0.1:

Multiple improvements to reduce total buffer sizes, allows running larger chunks (3x for ImageNet):

Taking buffer sizes into account when coloring graph
Maxout, ReLU, and MaxSubsampling layers consume much less memory in CUDA backend
Action graph is optimized to exclude unnecessary concurrency - taking into account device width here

Migrated to cuDNN v3
Reusing CUDA streams
Allocating chunk of mem for fixed working buffers - improves perf
Few bug-fixes

See buffer graph coloring for the optimized action graph of VGG-A-like schema to the right. You can get this and other interesting graphs by specifying "--debug_mode 1" option.

Nov 7, 2015

nnForge v2.0.0

Hi all,

6 months passed since last nnForge release and there is a good reason for it: I have been working on a major framework redesign, and now it is out! See nnForge v2.0.0:

The model is now arbitrary DAG (directed acyclic graph)
Running independent actions in mutiple streams in CUDA backend
Memory buffers are heavily reused

The changes are so radical, I had to drop support for the old trained data storage format. Unfortunately this means you will have to re-train your models from scratch.

Expect more goodies in near future!

Pages

Nov 24, 2015

nnForge v2.0.1

Nov 7, 2015

nnForge v2.0.0