LI Xiao-ming, BAO Chang-chun. A Unified Speech and Audio Coding with Empirical Model Decomposition[J]. JOURNAL OF SIGNAL PROCESSING, 2013, 29(10): 1274-1282.
Citation: LI Xiao-ming, BAO Chang-chun. A Unified Speech and Audio Coding with Empirical Model Decomposition[J]. JOURNAL OF SIGNAL PROCESSING, 2013, 29(10): 1274-1282.

A Unified Speech and Audio Coding with Empirical Model Decomposition

Funds: National Natural Science Foundation of China (61072089,61201197);Beijing Natural Science Foundation Program and Scientific Research Key Program of Beijing Municipal Commission of Education (KZ201110005005)
More Information
  • Received Date: July 08, 2013
  • Revised Date: August 17, 2013
  • Published Date: October 24, 2013
  • In this paper, a unified speech and audio coding method that based on Empirical Mode Decomposition (EMD) by exploiting the harmonic structure of input signal was proposed. This coder can achieve a high performance for both speech and audio signals at low and medium bitrates, which cannot be done by the codec with one single analysis model. Prior to the quantization, the EMD was adopted to extract the harmonic components of the input signal, after this, the extracted harmonic signal was modeled and quantized by sinusoidal model and perceptual weighted matching pursuit. For the quantization residual of harmonic signal, the dithered lattice vector quantization was used to improve the subjective quality. Finally, both the objective PESQ/PEAQ results and subjective A/B listening tests show that the proposed coder outperforms the ITU-T G.722.1 and G.722.2 codec.
  • Related Articles

    [1]WANG Chao, YU Yibiao. Vector Quantization Regularized Variational Autoencoders For Non-parallel Voice Conversion[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(7): 1339-1345. DOI: 10.16798/j.issn.1003-0530.2021.07.025
    [2]Chen Sijia, Zhao Zhijin, Zhang Xiaofei. The Quantization Kernel Least Mean P-norm Algorithm Using Transfer Learning[J]. JOURNAL OF SIGNAL PROCESSING, 2019, 35(8): 1366-1375. DOI: 10.16798/j.issn.1003-0530.2019.08.011
    [3]ZHOU Wei-Dong, YANG Zhen, YU Yun. speech enhancement by using modified orthogonal matching pursuit algorithm[J]. JOURNAL OF SIGNAL PROCESSING, 2016, 32(3): 287-295. DOI: 10.16798/j.issn.1003-0530.2016.03.005
    [4]HUANG Zhen, BAI Zheng-Yao, MO Yu-Jun. Spectrum Sensing using the algorithm of Compressive Sampling Matching Pursuit[J]. JOURNAL OF SIGNAL PROCESSING, 2014, 30(9): 1086-1090.
    [5]LIU Bin, TAO Jian-hua, MO Fu-yuan. Research on Speech Coding Algorithm at Very Low Bit Rate for Narrow-Band Communication[J]. JOURNAL OF SIGNAL PROCESSING, 2013, 29(9): 1134-1141.
    [6]DENG Xiao-Man, PAN Zhi-Bin, GAO Feng-Juan. Improved high efficiency information embedding scheme based on modified fast correlation vector quantization[J]. JOURNAL OF SIGNAL PROCESSING, 2012, 28(8): 1139-1147.
    [7]ZHANG Xiao-Wei, LI Ming, ZUO Lei. Sparse Signal Recovery Based on Stepwise Compressed Sampling Matching Pursuit[J]. JOURNAL OF SIGNAL PROCESSING, 2012, 28(6): 886-893.
    [8]WANG Fa-Song, ZHANG Lin-Rang, ZHOU Yu. Multiple Measurement Vectors for Compressed Sensing: Model and Algorithms Analysis[J]. JOURNAL OF SIGNAL PROCESSING, 2012, 28(6): 785-792.
    [9]ZHU Yan-Wan, ZHAO Yong-Jun, SUN Bing. A Modified Sparsity Adaptive Matching Pursuit Algorithm[J]. JOURNAL OF SIGNAL PROCESSING, 2012, 28(1): 80-86.
    [10]XIAO Qiang, CHEN Liang, ZHU Tao, HUANG Jian-Jun. Dimension Reduction Quantization of LSP Parameters Based on Compressed Sensing[J]. JOURNAL OF SIGNAL PROCESSING, 2011, 27(4): 563-568.

Catalog

    Article Metrics

    Article views (478) PDF downloads (1243) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return