Single-channel Speech Enhancement Algorithm Combining Deep Convolutional Recurrent Neural Network And Time-frequency Attention Mechanism

Yan Zhaoyu; Wang Jing

doi:10.16798/j.issn.1003-0530.2020.06.007

Yan Zhaoyu, Wang Jing. Single-channel Speech Enhancement Algorithm Combining Deep Convolutional Recurrent Neural Network And Time-frequency Attention Mechanism[J]. JOURNAL OF SIGNAL PROCESSING, 2020, 36(6): 863-870. DOI: 10.16798/j.issn.1003-0530.2020.06.007

Citation:

Single-channel Speech Enhancement Algorithm Combining Deep Convolutional Recurrent Neural Network And Time-frequency Attention Mechanism

Graphical Abstract

Abstract

Abstract

The purpose of speech enhancement is to separate clean speech signal from speech mixed with additional noise, improve speech quality and speech intelligibility. In recent years, supervised deep learning neural networks have been a popular method of speech enhancement. Convolutional recurrent neural network is a novel network structure including encoder, middle layer and decoder. Time-frequency attention mechanism is a simple network module composed of several convolutional layers with skip connections. In training process, it can compute the non-local correlations of speech magnitude. In this paper, we applied T-F attention module into a convolutional neural network. The experimental results show in different signal-to-noise ratio conditions, the proposed method can further improve speech quality and intelligibility, and maintain more speech harmonic information, and achieve lower speech distortion.

FullText(HTML)

References (0)

Supplements (0)

Cited By

Single-channel Speech Enhancement Algorithm Combining Deep Convolutional Recurrent Neural Network And Time-frequency Attention Mechanism

Abstract

Catalog

Export File

Citation

Format

Content