GpuConvOp thread block multiple of 32?

User picture