Research on Audio Super-resolution Method Based on FFTNet-GAN

XU Feng; LI Ping

doi:10.16798/j.issn.1003-0530.2021.01.007

XU Feng, LI Ping. Research on Audio Super-resolution Method Based on FFTNet-GAN[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(1): 59-65. DOI: 10.16798/j.issn.1003-0530.2021.01.007

Citation:

XU Feng, LI Ping. Research on Audio Super-resolution Method Based on FFTNet-GAN[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(1): 59-65. DOI: 10.16798/j.issn.1003-0530.2021.01.007

Citation:

XU Feng, LI Ping. Research on Audio Super-resolution Method Based on FFTNet-GAN[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(1): 59-65. DOI: 10.16798/j.issn.1003-0530.2021.01.007

Research on Audio Super-resolution Method Based on FFTNet-GAN

XU Feng,
LI Ping

Graphical Abstract

Abstract

Abstract

This paper proposes a generative adversarial network model based on FFTNet to achieve extreme audio super-resolution tasks. The generator uses parallel, non-causal, and non-local three-way split-sum FFTNet. This shallow model is fast and accurate. It can better extract the long-term correlation structure of time-domain audio and extract features at the desired resolution, can help improve reconstruction performance.In addition, a discriminator with matching performance is designed to stably adapt to the generation adversarial architecture. Fusion based on the frequency domain perceptual loss, fixed weight with sample space loss to reduce reconstruction distortion and improve perceptual quality. From the subjective and objective system evaluation, the method in this paper is better than the baseline model. Judging from the 2x/4x/6x times reduction effect, the model has extreme high-frequency reconstruction ability, which helps to improve the time resolution of the audio signal.

FullText(HTML)

References (27)

Supplements (0)

Cited By

Research on Audio Super-resolution Method Based on FFTNet-GAN

Abstract

Catalog

Export File

Citation

Format

Content