Pruning Deep Convolutional Neural Network Using Conditional Mutual Information
Vu-Van, Tien, Thanh, Dat Du, Ho, Nguyen, Vu, Mai
–arXiv.org Artificial Intelligence
Convolutional Neural Networks (CNNs) achieve high performance in image classification tasks but are challenging to deploy on resource-limited hardware due to their large model sizes. To address this issue, we leverage Mutual Information, a metric that provides valuable insights into how deep learning models retain and process information through measuring the shared information between input features or output labels and network layers. In this study, we propose a structured filter-pruning approach for CNNs that identifies and selectively retains the most informative features in each layer. Our approach successively evaluates each layer by ranking the importance of its feature maps based on Conditional Mutual Information (CMI) values, computed using a matrix-based Rényi α-order entropy numerical method. We propose several formulations of CMI to capture correlation among features across different layers. We then develop various strategies to determine the cutoff point for CMI values to prune unimportant features. This approach allows parallel pruning in both forward and backward directions and significantly reduces model size while preserving accuracy. Tested on the VGG16 architecture with the CIFAR-10 dataset, the proposed method reduces the number of filters by more than a third, with only a 0.32% drop in test accuracy. Convolution Neural Network (CNN) has achieved remarkable success in various tasks such as image classification, object detection, and segmentation (Zhang et al., 2019), (Li et al., 2021). Deeper architectures such as VGG16 (Simonyan & Zisserman, 2014) and ResNet (He et al., 2016) have shown superior performance in handling complex image classification tasks. However, the effectiveness of these networks is often reliant on very deep and wide architectures, resulting in a very large number of parameters that lead to longer training and inference time, and create challenges when deploying them on resource-constrained devices (Blalock et al., 2020), (Yang et al., 2017).
arXiv.org Artificial Intelligence
Nov-27-2024
- Country:
- Asia > Vietnam
- Hồ Chí Minh City > Hồ Chí Minh City (0.04)
- North America > United States
- Maryland (0.04)
- Asia > Vietnam
- Genre:
- Research Report > New Finding (0.88)
- Technology: