融合快照集成与知识迁移的人体行为识别算法

Radar Human Behavior Recognition Algorithm Based on Snapshot Ensembles and Transfer Learning

  • 摘要: 用卷积网络进行人体行为识别是毫米波雷达的一个热门研究方向。由于卷积网络结构的缺陷性,而且目前用于人体行为识别公开的雷达领域数据样本量过少,传统深度学习算法对雷达微多普勒图像的识别率不高,且在训练过程中容易出现过拟合的现象。针对上述问题,本文提出一种融合快照集成与迁移学习的雷达人体行为识别算法。首先,针对深度卷积网络无法提取图像全局特征的问题,该算法通过搭建Vision Transformer(VIT)模型引入注意力机制。其次,通过VIT模型在公开自然数据集上进行任务迁移和特征空间的迁移,解决微多普勒图像的识别过拟合的问题。最后,利用基于快照集成的投票机制算法,提升模型对复杂雷达微多普勒图像的识别能力。试验结果表明,在目标任务数据集样本量少、背景复杂的情况下,该算法能在不增加训练成本的前提下提升微多普勒图像的识别准确率,在VIT模型下该算法识别准确率达到了89.25%,优于经典卷积神经网络。

     

    Abstract: ‍ ‍One of the most popular research fields of millimeter wave radar is to recognize human behavior by means of convolutional neural networks in deep learning. Due to the defect of convolutional neural networks’ structure, each layer of convolutional neural networks can only extract the relationship between the internal pixels of the receptive field, and lacks the ability to extract global features. Moreover, the number of public radar data samples used for human behavior recognition is too small at present, and the recognition accuracy of traditional depth learning algorithms for radar micro-Doppler images is not high, and the phenomenon of over fitting is easy to occur in the training process. To address these issues, a radar-based human behavior recognition algorithm based on Snapshot Ensembles and Transfer Learning, is proposed in this paper. First of all, in order to solve the problem that the traditional convolutional neural networks cannot extract the global features of images in the field of deep learning, this algorithm introduced attention mechanism by building a model named Vision Transformer (VIT). After that, the public natural image dataset named ImageNet, was used to perform task migration and feature space migration on the Vision Transformer model, which aimed to solve the problem of identification overfitting of radar micro-Doppler images. Last but not least, the voting mechanism algorithm based on Snapshot Ensembles was used in this paper to improve the recognition ability of the Vision Transformer model, though radar micro-Doppler images have complex background information. In a word, the experimental results show that, in case of too few samples capacity and complex background of target task data set, this algorithm can effectively improve the recognition accuracy of radar micro-Doppler images without increasing the training cost. Using this algorithm on Vision Transformer model, the recognition accuracy rate of the algorithm for radar micro-Doppler reached more than 89.25%, which is better than the typical convolutional neural networks.

     

/

返回文章
返回