基于仿射变换的胶囊网络特征研究

Study on Characteristics of Capsule Network based on Affine Transformation

  • 摘要: 针对卷积神经网络(CNN)特征表达的空间信息缺陷问题,深度学习的创始人之一G. E. Hinton提出胶囊网络(Capsule Network)模型。相比于CNN网络特征,该模型的特征呈矢量化,能描述部分到整体之间的位置变换信息。为了更好的理解基本仿射变换下的胶囊网络的特征,本文借助Tensorflow中集成的仿真工具Tensorboard,从可视化的角度来研究平移、旋转等基础仿射变换下的胶囊网络特征。不仅如此,本文为平移、旋转过程中的实验结果准确性提供了评价指标。实验结果表明:胶囊网络的确可以通过内部集成的胶囊模块去学习和输出包括视觉实体正确的姿态、形变等信息在内的“实例化参数”,胶囊网络(Capsule Network)对于位置、方向变化的处理比传统技术方法更直观。

     

    Abstract: In view of the problem of spatial information defects expressed in Convolution Neural Network (CNN), G. E. Hinton, one of the founders of deep learning, proposed the Capsule Network Model. Compared with that of CNNs, the feature of the Capsule model takes the form of vectors, which convey the location transformation information from part to whole. In order to better understand the characteristics of the capsule network under the basic affine transformation, this paper uses Tensorboard that is an integrated simulation tool in Tensorflow, from the perspective of visualization, to study the characteristics of the capsule network under the basic affine transformation, such as translation and rotation. In addition, this paper provides an evaluation index for the accuracy of experimental results during translation and rotation. The experimental results show that the Capsule Network can actually learn and output "instantiation parameters" including the correct attitude and deformation information of the visual entity through the internal integrated capsule module. The Capsule Network is more intuitive to deal with the position and direction changes than the traditional neural networks.

     

/

返回文章
返回