基于YOLOv3-SPP的遥感图像目标检测压缩模型
Compression Model for Remote Sensing Image Target Detection Based on YOLOv3-SPP
-
摘要: 近年来,卷积神经网络模型已被广泛应用于遥感图像目标检测任务中,但自然场景图像与遥感图像的目标特性存在差异,针对自然场景设计的模型往往难以在遥感图像任务中取得良好的效果。同时,很多遥感图像处理任务需要在星载、机载等资源有限的平台中进行,难以部署参数量、计算量大的复杂模型。针对以上问题,本文对在自然场景中性能优异的YOLOv3-SPP模型进行适应性改进及参数压缩。首先,对原始的L1范数剪枝算法进行改进,提出基于L1范数和均值差的加权剪枝算法,能够更好地保留重要的通道。其次,对剪枝后的子网集合进行快速评估,选取评估结果最好的子网进行微调。在预训练和微调阶段,本文将SPP模块中的最大池化层替换为softmax加权池化层,着重突出深层网络中权重较大的特征,提高了模型的检测精度。本文在多个公开遥感数据集上进行实验,结果表明改进的YOLOv3-SPP模型在遥感目标检测任务上具有更好的性能,同时本文的剪枝算法可以在相同的参数压缩比例条件下,降低模型的性能损失。Abstract: In recent years, convolutional neural network (CNN) models have been widely applied in remote sensing image object detection tasks. However, the target characteristics of natural scene images and remote sensing images differ, and models designed for natural scenes often perform poorly in remote sensing image tasks. Additionally, many remote sensing image processing tasks need to be carried out on resource-limited platforms such as satellites and aircraft, making it difficult to deploy complex models with large parameter and computational requirements. To address these issues, this paper proposes adaptive improvements and parameter compression for the YOLOv3-SPP model, which has shown outstanding performance in natural scenes. First, an improved weighted pruning algorithm based on L1 norm and mean difference is proposed to better preserve important channels. Next, the pruned subnet collection is quickly evaluated, and the best-performing subnet is fine-tuned. In the pre-training and fine-tuning stages, the maximum pooling layer in the SPP module is replaced with a softmax weighted pooling layer to highlight features with larger weights in the deep network and improve the detection accuracy of the model.Experiments conducted on multiple public remote sensing datasets show that the improved YOLOv3-SPP model has better performance in remote sensing object detection tasks. Additionally, the pruning algorithm in this paper can reduce the model’s performance loss under the same parameter compression ratio.