基于类别自适应动态阈值与参数迁移学习的小样本声呐图像分类方法

Few-Shot Sonar Image Classification Method Based on Class-Adaptive Dynamic Threshold and Parameter Transfer Learning

  • 摘要: 声呐图像分类是水下目标探测与识别的关键技术,广泛应用于海洋资源勘察、生物监测等领域。由于海洋环境背景复杂,声呐图像分辨率低、噪声严重,且大量数据因无法实时标注而成为无标签数据。此外,由于声呐设备成本高、技术高度专业化,开源且高质量的声呐图像数据集极为稀缺,且不同水下目标类别在数据集中的样本数量差异较大,难以直接采用机器学习进行识别分类。论文提出一种基于类别自适应动态阈值与参数迁移学习的小样本声呐图像分类方法,以解决声呐图像数据不足、类别不平衡的问题。首先,利用参数迁移学习思想,在大规模ImageNet图像数据集上预训练轻量化AlexNet模型,并利用带标签的声呐图像数据集进行微调,以解决声呐图像小样本问题。其次,设计一种类别自适应动态阈值机制,根据不同水下目标类别分布情况动态调整识别声呐图像的置信度阈值,以筛选合适的伪标签数据、缓解类不平衡带来的训练偏差问题。最后,将筛选后的声呐图像伪标签数据与已有标签数据融合,构建新的训练集进行半监督重训练,并通过合理设置优化器参数进一步提升模型的鲁棒性和泛化能力。仿真结果表明,在真实声呐图像数据集上,所提方法在仅使用21%标签数据情况下的整体分类准确率可达到96.15%,平均F1分数可达到93.48%,验证了所提方法在小样本和数据类别不平衡的条件下,仍能实现较优的分类效果。

     

    Abstract: Sonar image classification is a key technology for underwater target detection and recognition, and it is widely applied in fields such as marine resource exploration and biological monitoring. Owing to the complex background of the marine environment, the low resolution and severe noise of sonar images, and the large amount of data that remain unlabeled because they cannot be labeled in real time, challenges arise. In addition, owing to the high cost of sonar equipment and high degree of technical specialization, open-source and high-quality sonar image datasets are extremely scarce, and the number of samples of different underwater target categories in the dataset varies greatly, making it difficult to directly use machine learning for identification and classification. In this paper, we propose a few-shot sonar image classification method based on category-adaptive dynamic thresholds and parameter transfer learning to solve the problems of insufficient sonar image data and category imbalance and to address the difficulty for traditional deep learning models to achieve good generalization. First, leveraging the concept of parameter transfer learning, a lightweight AlexNet model was pre-trained on the large-scale ImageNet image dataset. Then, this model was fine-tuned using a labeled sonar image dataset to solve the few-shot problem of sonar images. Second, a category-adaptive dynamic threshold mechanism was designed. According to the distribution of different underwater target categories, the confidence threshold for recognizing sonar images is dynamically adjusted to screen appropriate pseudo-labeled data and alleviate the training bias problem caused by class imbalance. Finally, the screened pseudo-labeled sonar data were fused with existing labeled data to construct a new training set for semi-supervised retraining. By reasonably setting the optimizer parameters, the robustness and generalization ability of the model were further improved. The simulation results show that on the real sonar image dataset, the proposed method can achieve an overall classification accuracy of 96.15% and an average F1 score of 93.48% when using only 21% of the labeled data. This confirms that the proposed method can achieve excellent classification performance under conditions of few-shot data and data category imbalance.

     

/

返回文章
返回