在线更新噪声基矩阵的非负矩阵分解语音增强方法

Online Update of Noise Basis Matrix for the NMF-based Speech Enhancement

  • 摘要: 基于非负矩阵分解(Nonnegative matrix factorization, NMF)的语音增强算法需要和背景噪声类型匹配的噪声基矩阵(Basis matrix),而在实际中,这是很难被保证的。本文提出了一种基于噪声基矩阵在线更新的非负矩阵分解语音增强方法,该方法首先利用一个无语音帧判决模块识别出带噪语音的无语音区域,然后利用一个固定长度的滑动窗口(Sliding window)来包含若干帧最近过去的带噪语音的无语音帧,并用这些无语音帧的幅度谱在线更新噪声基矩阵,最后利用更新得到的噪声基矩阵和预先训练的语音基矩阵实现语音增强。该方法能够在线更新出匹配的噪声基矩阵,有效地解决了噪声基矩阵不匹配的问题。实验证明,本文所提的方法在线学习到的噪声基矩阵在大多数条件下比匹配训练集下训练得到的噪声基矩阵的性能还要优越。

     

    Abstract: In nonnegative matrix factorization (NMF)-based speech enhancement, the matched noise basis matrix needs to be trained, which is difficult to be guaranteed in practice. In this paper, an NMF-based speech enhancement method is proposed in which the noise basis matrix is updated online. First, the non-speech regions of noisy signal are determined by utilizing a decision module of non-speech frame. Then, a fixed-length sliding window is used to cover several recent past frames of noisy speech determined as non-speech, and the magnitude spectrums of these non-speech frames are used to update the noise basis matrix online. After that, the updated noise basis matrix and the pre-trained speech basis matrix are used to achieve speech enhancement. This method can obtain the matched noise basis matrix online and effectively solve the problem of the mismatch of the noise basis matrix. The test results demonstrate that the noise basis matrix trained online by the proposed method performs better than that trained from the matched dataset in most conditions.

     

/

返回文章
返回