SAR图像去斑的频率驱动异质模型
A Frequency-Driven Heterogeneous Model for SAR Image Despeckling
-
摘要: 现有合成孔径雷达(Synthetic Aperture Radar, SAR)图像去斑模型大多来源于相关通用视觉领域,普遍缺乏对散斑噪声空间异质分布特性的处理能力。本文提出一个用于SAR图像去斑的频率驱动异质模型,通过池化层显式分离高频和低频特征,并依据不同噪声分布特性分别处理。首先,设计了一种深度卷积交叉协方差注意力,能够对高频特征进行联合空间-通道全局建模,增强边缘关键信息表示,去除复杂散斑效应;随后,引入一种大核卷积残差模块,利用感受野优势有效抑制低频特征中的简单散斑影响;最后,为强化频率特征间的互补性交互,进一步提升去斑性能,提出一种选择性频率融合模块,能够通过少量参数自适应融合经异质处理后的高频和低频特征。在模拟和真实SAR图像数据集上的实验结果表明,本文模型在量化指标和视觉效果上均优于多种现有的SAR图像去斑方法。相较于次优方法,平均峰值信噪比(Peak Signal-to-Noise Ratio, PSNR)提高约0.14 dB,平均结构相似度(Structure Similarity Index Measure, SSIM)提高约0.01,而推理时间减少99%以上。相较于经典的轻量化方法,平均PSNR提高0.78~1.12 dB,平均SSIM提高0.02~0.06。Abstract: Most of the existing Synthetic Aperture Radar (SAR) image despeckling models have been derived from related generic vision fields and generally lack the capability to handle the spatially heterogeneous distribution characteristics of speckle noise. This paper proposes a Frequency-Driven Heterogeneous Model for SAR image despeckling, which explicitly separates high- and low-frequency features using pooling layers and processes them separately based on their distinct noise-distribution characteristics. First, a Dconv Cross-Covariance Attention mechanism was designed to model the spatial-channel global characteristics of high-frequency features, enhancing the representation of critical edge information and removing complex speckle effects. Then, a Large Kernel Convolutional Residual Module was introduced to effectively suppress the simple speckle interference in low-frequency features by leveraging the receptive field advantage. Finally, to strengthen the complementary interaction between frequency features and further improve the despeckling performance, a Selective Frequency Fusion Module was used to adaptively fuse the heterogeneously processed high- and low-frequency features with minimal parameters. The results of experiments on both simulated and real SAR image datasets demonstrated that the proposed model outperformed some existing despeckling methods in both quantitative metrics and visual quality. Compared to the suboptimal method, it achieved average PSNR and SSIM increases of 0.14 dB and 0.01, respectively, while the inference time was reduced by at least 99%. Compared to classical lightweight methods, it achieved average PSNR and SSIM increases of 0.78~1.12 dB and 0.02~0.06, respectively.
下载: