改进的非负矩阵分解语音增强算法

Improved Nonnegative Matrix Factorization based Speech Enhancement Algorithm

  • 摘要: 本文提出了一种改进的非负矩阵分解语音增强算法,该算法可分为训练和增强两部分。首先,为了降低训练复杂度,采用卷积非负矩阵分解只提取噪声字典。增强时,考虑语音信号稀疏性比噪声信号稀疏性强,通过稀疏非负矩阵分解重构出语音幅度谱,采用交替方向乘子法进行优化迭代,克服了经典乘性迭代易陷入局部最优、分母只能收敛到零极限等问题。最后,基于算法融合的思想,将重构的语音幅度谱与谱减法、最小均方误差幅度谱估计得到的幅度谱进行加权融合。仿真实验中,在10种不同噪声环境中,通过多种评价标准证明所提算法能取得较好的增强效果。

     

    Abstract: This paper proposes an improved Nonnegative Matrix Factorization based speech enhancement algorithm. It mainly consists of a training stage and an enhancement stage. Firstly, to reduce the complexity in the training stage, only noise dictionary is trained using Convolution Nonnegative Matrix Factorization as the prior information. During the enhancement stage, considering the different sparse properties shown between noise and speech, the speech is separated from noise by using Sparse Nonnegative Matrix Factorization. Particularly, the update process is solved by putting the theory of Alternating Direction Method of Multipliers, which can solve the problems such as poor local optima and limitation of convergence to zero in the traditional speech enhancement using Nonnegative Matrix Factorization. Finally, to take advantage of the unsupervised speech enhancement algorithm, Spectral Subtraction and Minimum Mean Square Log-spectral based methods are combined to improve the performance. Extensive experiments indicate that the proposed algorithm shows promising noise suppression ability under ten various noise conditions.

     

/

返回文章
返回