XU Feng, LI Ping. Research on Audio Super-resolution Method Based on FFTNet-GAN[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(1): 59-65. DOI: 10.16798/j.issn.1003-0530.2021.01.007
Citation: XU Feng, LI Ping. Research on Audio Super-resolution Method Based on FFTNet-GAN[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(1): 59-65. DOI: 10.16798/j.issn.1003-0530.2021.01.007

Research on Audio Super-resolution Method Based on FFTNet-GAN

  • This paper proposes a generative adversarial network model based on FFTNet to achieve extreme audio super-resolution tasks. The generator uses parallel, non-causal, and non-local three-way split-sum FFTNet. This shallow model is fast and accurate. It can better extract the long-term correlation structure of time-domain audio and extract features at the desired resolution, can help improve reconstruction performance.In addition, a discriminator with matching performance is designed to stably adapt to the generation adversarial architecture. Fusion based on the frequency domain perceptual loss, fixed weight with sample space loss to reduce reconstruction distortion and improve perceptual quality. From the subjective and objective system evaluation, the method in this paper is better than the baseline model. Judging from the 2x/4x/6x times reduction effect, the model has extreme high-frequency reconstruction ability, which helps to improve the time resolution of the audio signal.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return