基于多尺度特征残差学习卷积神经网络的视频超分辨率方法

林琦; 陈婧; 曾焕强; 朱建清; 蔡灿辉

doi:10.16798/j.issn.1003-0530.2020.01.007

基于多尺度特征残差学习卷积神经网络的视频超分辨率方法

Video Super-Resolution Method Based on Multi-Scale Characteristics Residual Learning Convolutional Neural Network

摘要

摘要: 本文提出了一种基于多尺度特征残差学习卷积神经网络的视频超分辨率方法，考虑到视频帧间的时空相关性，所提的方法采用由双三次插值预处理后的连续五帧视频作为卷积神经网络的输入，经由网络重建中间帧作为输出，依次按顺序重建直至获得整个高分辨率视频。本文所提出的卷积神经网络主要由多尺度特征提取、残差学习、亚像素卷积层、残差连接(skip-connection)四大部分组成，通过对视频的多尺度特征的提取获得更丰富的不同尺度特征和残差学习达到较好地恢复高频信息的目的。本文采用峰值信噪比（PSNR）和结构相似性指数（SSIM）作为损失函数优化网络。实验结果表明，本方法在平均评价指标上较其他方法均有一定的提升（PSNR +3.151dB，SSIM +0.102），从主观评价上看可以有效地减少视频边缘模糊的现象。

Abstract: A video super-resolution method based on multi-scale characteristics residual learning convolutional neural network is pro-posed in this paper. Considering the spatial-temporal relationships between frames of video, five adjacent frames of video is preprocessed by bicubic interpolation as the input of the proposed convolutional neural network; After the network, the re-constructed intermediate frame is output, reconstructed in order till the whole high-resolution video is obtained. The pro-posed convolutional neural network is composed of four parts, including multi-scale feature extraction, residual learning, sub-pixel convolutional layer, and skip-connection. With multi-scale characteristics extraction and the residual learning, the proposed method provides abundant features and restoring more high-frequency information. Peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM) are used as the loss function to optimize the network. Experimental results have shown that compared with latest algorithms, the proposed method has a significant improvement on the average evalu-ation indicator (PSNR +3.151dB, SSIM +0.102) and effectively reduces the edge blurring phenomenon of the subjective visual effect.

HTML全文

参考文献(0)

施引文献

资源附件(0)