TIAN Xiang, ZHANG Liang. Improved R-C3D temporal action detection network[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(3): 447-455. DOI: 10.16798/j.issn.1003-0530.2021.03.015
Citation: TIAN Xiang, ZHANG Liang. Improved R-C3D temporal action detection network[J]. JOURNAL OF SIGNAL PROCESSING, 2021, 37(3): 447-455. DOI: 10.16798/j.issn.1003-0530.2021.03.015

Improved R-C3D temporal action detection network

  • In order to improve the classification accuracy and the temporal positioning accuracy of the temporal action detection network, this paper proposes an improved Region Convolutional 3D neural network(R-C3D).In the temporal proposal subnet, the height and width of the feature map is changed from (H/16, W/16) to (1, 1) through layer-by-layer spatial convolution.So the classification accuracy is improved.According to the idea of deconvolution in Convolutional-De-Convolutional Networks (CDC), the time domain deconvolution network is used to increase the length of the feature map and improve the temporal positioning accuracy of the behavior.The experimental results on the THUMOS14 data set show that the proposed method has better detection accuracy than R-C3D on long-time unsegmented videos.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return