基于频率信噪比加权的麦克风阵列声源定位算法

Sound source localization using SNR-based frequency weighting with microphone array

  • 摘要: 为了提高噪声和混响环境下麦克风阵列的声源定位算法性能,提出了一种基于频率信噪比加权的可控响应功率定位算法。该算法首先根据每帧阵列信号的频域协方差矩阵估计每个频率的信噪比;然后通过激活函数将频率信噪比映射为加权值,并修正传统的相位变换可控响应功率计算公式;最后利用修正公式计算每个候选位置的可控响应功率值,通过搜索可控响应功率的最大值实现声源定位。该算法根据实时估计的频率信噪比自适应地调整各频率分量对可控响应功率的贡献。仿真结果表明,与传统的相位变换可控响应功率算法、维纳预滤波波束形成算法相比,在噪声和混响的复杂声学环境下,本文算法的定位正确率更高,均方根误差更小,对噪声的鲁棒性更强。

     

    Abstract: In order to improve the performance of sound source localization with microphone array in reverberant noisy environments, a frequency weighted sound source localization algorithm based on signal-to-noise ratio (SNR) is proposed. First, SNR of each frequency is estimated from the covariance matrix of the array signals in each frame. Then, the SRP-PHAT (steered response power-phase transform) formula is revised by the weight mapped from the frequency SNR by activation function. Finally, the SRP of each candidate location is calculated by the revised formula, and the sound source location is estimated by searching the maximum value of all the SRPs. The proposed method adaptively adjusts the contribution of frequency component to SRP according to the frequency SNR estimated in real time. Simulation results show that, compared with the conventional SRP-PHAT algorithm and Wiener pre-filtering beamformer algorithm, the proposed method obtains a higher percentage of correct estimates and a lower root mean square error, and is more robust against noise in complex noisy and reverberation environments.

     

/

返回文章
返回