YANG Yang, GUO Haiyan, WANG Tingting, ZHANG Pengcheng, YANG Zhen. Multichannel Speech MVDR Enhancement Algorithm Based on Joint Spatial-temporal Graph Topology[J]. JOURNAL OF SIGNAL PROCESSING, 2023, 39(3): 540-549. DOI: 10.16798/j.issn.1003-0530.2023.03.016
Citation: YANG Yang, GUO Haiyan, WANG Tingting, ZHANG Pengcheng, YANG Zhen. Multichannel Speech MVDR Enhancement Algorithm Based on Joint Spatial-temporal Graph Topology[J]. JOURNAL OF SIGNAL PROCESSING, 2023, 39(3): 540-549. DOI: 10.16798/j.issn.1003-0530.2023.03.016

Multichannel Speech MVDR Enhancement Algorithm Based on Joint Spatial-temporal Graph Topology

  • ‍ ‍In this paper, multichannel speech enhancement in the graph frequency domain is investigated, and a joint graph topology in the spatial-temporal dimension is constructed using graph signal processing (GSP) theory, based on which enhancement algorithms are designed for multichannel speech denoising. Specifically, a temporal graph topology is constructed based on the temporal correlation of speech vertex signals between the input frames of a microphone of the input array; Meanwhile, a spatial graph topology based on the spatial correlation of received signals in each channel is built for multi-channel noisy speech. Based on a joint graph topology composed of temporal and spatial bipartite graph topologies, a joint graph topology-based minimum variance distortionless response (MVDR) enhancement algorithm in the graph frequency domain is used to perform multichannel speech enhancement. Numerical simulation results show that the proposed joint graph topology-based MVDR (JG-MVDR) beamforming method outperforms both the regular graph-based MVDR (GMVDR) beamforming method and the complex Gaussian mixture model based MVDR (CGMM-MVDR) beamforming method in terms of the average perceptual evaluation of speech quality (PESQ) and the average extended short-time objective intelligibility (ESTOI).
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return