Abstract: The optimization of convolutional neural networks (CNN) generally refers to the improvement of the inference process, making it as fast and precise as possible. While inference time is an ...