Mixed-Precision Network Quantization for Infrared Small Target Segmentation
Citations Over TimeTop 1% of 2024 papers
Abstract
Network quantization is leveraged to reduce the model size, memory footprint, and computational cost of deep neural networks. It is achieved by representing float weights and activations with lower bit counterparts, which is essential for model deployment on resource-limited devices. However, due to the extremely small size of infrared small targets in the feature map, low-bit quantization could lead to huge information loss of small targets and thus causes severe segmentation performance degradation. To achieve low-bit quantization while maintaining the segmentation performance, we first study the quantization sensitivity of small target segmentation network and observe the sensitivity heterogeneity of different layers in the network. Specifically, feature maps in shallow layers and encoder subnetwork are more vulnerable to information loss caused by quantization as compared to deep layers and decoder subnetwork. Based on these observations, we are motivated to assign a different bitwidth for each block according to their quantization sensitivity. A simple yet effective symmetrically progressive decreasing mixed-precision quantization (SPMix-Q) method is proposed to achieve high-performance segmentation under low-bit quantization (i.e., 2.42 bits for weights and 3.82 bits for activations). The experimental results show that our SPMix-Q achieves comparable accuracy with only 1/13 model size, 1/4.6 memory footprint, and 1/29 computational cost to the full-precision counterparts. Compared with the homogeneous low-bit quantization methods, our method achieves much better performance in terms of intersection of union (IoU) on the benchmark datasets. Our mobile-system-on-a-chip (SOC) (e.g., Kyrin 980, Snapdragon 660, and Dimensity 800U) deployable android application package (APK) is available at: https://github.com/YeRen123455/SIRST-Quantization-Deployment .
Related Papers
- → Design and performance of a multiwavelength airborne polarimetric lidar for vegetation remote sensing(2004)68 cited
- → Radiometric Calibration of the Landsat MSS Sensor Series(2011)43 cited
- → Interactive Segmentation for COVID-19 Infection Quantification on Longitudinal CT scans(2021)
- → Conditional Matting For Post-Segmentation Refinement Segment Anything Model(2023)
- Investigation of Ground Remote Sensing Technique Using CCD Camera(2006)