CuDNN Now Accelerates Filter Groups!

With the latest cuDNN 7 release, a request of mine to the cuDNN team from just over a year ago has finally come to fruition - filter groups are now properly handled by the popular framework which provides accelerated code for common deep learning operations on Nvidia GPUs, according to the release notes:

Grouped Convolutions for models such as ResNeXt and Xception and CTC (Connectionist Temporal Classification) loss layer for temporal classification.

For more information, see the CuDNN release notes. Thanks to Michael Figurnov for pointing this out to me!

Leave a Comment

Your email address will not be published. Required fields are marked *

Loading...