Speed test and integrate Alex's convolution code on GPU

User picture