[1] |
J. Li, S. Sakamoto, S. Hongo. Adaptive β-order generalized spectral subtraction for speech enhancement [J]. Signal Processing, 2008, 88(11): 2764-2776.
|
[2] |
A. Borowicz, A. Petrovsky. Signal subspace approach for psychoacoustically motivated speech enhancement [J]. Speech Communication, 2011, 53(2): 210-219.
|
[3] |
J. Chen, J. Benesty J, Y. Huang. New insights into the noise reduction Wiener filter [J]. IEEE Transactions on audio, speech, and language processing, 2006, 14(4): 1218-1234.
|
[4] |
Y. Ephraim, D. Malah. Speech enhancement using a minimum mean-square error short-time Sspectral amplitude estimator [J]. IEEE Transactions on Acoust. Speech Signal Processing, 1984, 32(6): 1109-1102.
|
[5] |
M. Djendi, P. Scalart. Reducing over-and under-estimation of the a priori SNR in speech enhancement techniques [J]. Digital Signal Processing, 2014, 32: 124-136.
|
[6] |
R. Marti. Noise power spectral density estimation based on optimal smoothing and minimum statistics [J]. IEEE Transactions on Speech and Audio Processing, 2001, 9(5): 504-512.
|
[7] |
Y. S. Park and J. H. Chang. A probabilistic combination method of minimum statistics and soft decision for robust noise power estimation in speech enhancement [J]. IEEE Signal Processing Letters, 2008, 15(1): 95-98.
|
[8] |
I. Cohen. Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging [J]. IEEE Transactions on Speech and Audio Processing, 2003, 11(5): 466-475.
|
[9] |
T. Inoue, H. Saruwatari, Y. Takahashi. Theoretical analysis of musical noise in generalized spectral subtraction based on higher order statistics [J]. IEEE Transactions on Audio, Speech, and Language Processing, 2011, 19(6): 1770-1779.
|
[10] |
P. C. Loizou, Speech enhancement: theory and practice [M]. CRC Press, Boca Raton, FL, 2007.
|
[11] |
廖逢钗, 李鹏, 徐波. 音乐噪声环境下的双声道语音活动检测 [J]. 信号处理, 2009, 25(11): 1820-1824. F. Liao, P. Li, B. Xu. Dual-channel voice activity detection in music noise envoronments [J]. Journal of Signal Processing, 2009, 25(11): 1820-1824.
|
[12] |
X. Zhang, D. Wang. Boosting contextual information for deep neural network based voice activity detection [J]. IEEE Transactions on Audio, Speech, and Language Processing, 2016, 24(2): 252-264.
|
[13] |
X. Zhang, J. Wu. Deep belief networks based voice activity detection [J]. IEEE Transactions on Audio, Speech, and Language Processing, 2013, 21(4): 697-710.
|
[14] |
A. Abramson, I. Cohen. Simultaneous detection and estimation approach for speech enhancement [J]. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(8): 2348-2359.
|
[15] |
A. Papoulis, U. Pillai. Probability, random variables and stochastic processes [M], McGraw-Hill. 2011.
|
[16] |
S. R. Quackenbush, T. P. Barnwell, M. A. Clements. Objective measures of speech quality [M]. Prentice Hall, 1988.
|
[17] |
Y. Hu. and Loizou, P. Evaluation of objective quality measures for speech enhancement [J]. IEEE Transactions on Speech and Audio Processing, 2008, 16(1), 229-238.
|