视觉动态信息加工机制启发的非稳定视频光电小目标检测

罗柯; 杨欣; 李帅; 赵志坚; 王皓; 郭大庆; 王刚

doi:10.12466/xhcl.2026.05.011

视觉动态信息加工机制启发的非稳定视频光电小目标检测

Detection of Photoelectric Small Targets in Unstable Videos Inspired by the Visual Dynamic Information Processing Mechanism

摘要

摘要: 基于光电视频的复杂背景小目标检测识别是安全领域的关键技术。然而，受载体运动与外界扰动等因素影响，光电检测成像普遍存在非稳定晃动现象，导致背景时序一致性被破坏，小目标信噪比降低，进而影响检测性能。针对非稳定视频条件下光电小目标检测性能退化的问题，提出了一种受视觉动态信息加工机制启发的光电小目标检测架构。首先，设计了面向非稳定成像序列的预处理稳像方法，通过抑制全局背景干扰并增强小目标的时序响应特性，为后续特征提取提供更加稳定的输入信号；其次，借鉴生物视觉系统中大细胞通路的动态信息处理机理，构建运动响应特征图，并结合稳像后的视频帧，以强化小目标的运动显著性与区域判别能力。进而，将稳像后提取的时序运动信息与图像外观特征在卷积网络中联合建模，并作为重要补充信号引导深度检测网络；最后，搭建生物启发检测框架SMSOD（Stabilized Magnocellular inspired Small Object Detector），有效增强在复杂动态背景下红外小目标的感知能力。在真实光电小目标数据上进行对比实验验证，非稳定条件下的检测准确率为73.1%，经过稳像预处理后，检测精度提升至74.9%。在此基础上进一步融合运动信息，检测精度达到91.8%，较原始晃动测结果提升了18.7%。表明了所提出的方法能够有效抑制全局晃动带来的不利影响，为非稳定视频条件下的小目标检测提供了一种稳定且鲁棒的信号处理方法。

Abstract: Small target detection and recognition in complex backgrounds based on photoelectric video is a key technology in the field of security. However， due to factors such as carrier motion and external disturbances， photoelectric detection imaging often suffers from unstable jitter. This jitter disrupts the temporal consistency of the background， reduces the signal-to-noise ratio （SNR） of small targets， and degrades detection performance. To address the degradation of photoelectric small target detection performance under unstable video conditions， this study proposes a bio-inspired photoelectric small target detection architecture based on the dynamic information processing mechanism of biological vision. First， an image stabilization preprocessing method for unstable imaging sequences is designed. By suppressing global background interference and enhancing the temporal response characteristics of small targets， this method provides more stable input signals for subsequent feature extraction. Second， inspired by the dynamic information processing mechanism of the magnocellular pathway in the biological visual system， a motion response feature map is constructed. This map is combined with stabilized video frames to enhance the motion saliency and regional discriminability of small targets. Furthermore， the temporal motion information extracted after image stabilization and image appearance features are jointly modeled within a convolutional neural network. This modeling serves as an important supplementary signal to guide the deep detection network， thereby constructing the bio-inspired detection model named SMSOD. This model effectively enhances the perception capability of infrared small targets in complex dynamic backgrounds. Experimental verification is conducted on real photoelectric small target datasets. The detection accuracy under unstable conditions reaches 73.1%. After image stabilization preprocessing， the detection accuracy increases to 74.9%. With further fusion of motion information， the detection accuracy reaches 91.8%， representing an improvement of 18.7% compared with the detection results obtained from the original jittery frames. The results demonstrate that the proposed method can effectively suppress the adverse effects caused by global jitter， providing a stable and robust signal processing approach for small target detection under unstable video conditions.

HTML全文

参考文献(32)

施引文献

资源附件(0)