非一致相似测度的图表示多观测样本分类算法

Multiple Observation Sets Classification Algorithm Based on Graphical  Presentation of Inconsistent Similarity Measure

  • 摘要: 多观测样本分类问题中,样本表示成流形上的点,针对如何利用多观测样本的流形结构提高其分类性能的问题,提出非一致相似测度的Graph表示多观测样本分类算法。首先综合数据的全局与局部结构特性,构造一个非一致相似测度,非一致相似测度主要考虑类内和类间差别,能有效地体现数据实际聚类的分布特性;其次构造非一致相似测度Graph,进而得到样本之间的相似度矩阵,然后通过一个格拉斯曼联合核把最佳投影的计算转化成寻找瑞利熵的最大特征向量问题,进而得到投影矩阵。最后将本征流形上的点投影到另一个流形上,使用最近邻分类器完成分类。在ETH-80物体识别数据库、CMU-PIE人脸数据库和BANCA数据库上进行对比实验,实验结果表明该方法优于传统方法。

     

    Abstract: In classification problem of multiple observation sets, the samples are represented as points on Grassmannian manifolds, with regard to how to exploit the manifold structure to improve the classification performance,a multiple observation sets classification algorithm based on inconsistent similarity measure graph is presented. First of all, considering the characters of global and local data structure comprehensively, an inconsistent similarity measure is constructed, which regards the distinction of within-class and between-class mainly and can effectively reflect the distribution character of actual data clustering. The second step is to obtain the similarity matrix via inconsistent similarity measure graph, after that, the computation of the optimal map is transformed into the search problem of the largest eigenvectors of the Rayleigh quotient by a combined Grassmannian kernel and then the projection matrix is obtained. Lastly, points on the manifold can be mapped into another space, the final classification is completed exploits the nearest neighbor classifier. Three comparative experiments are conducted on ETH-80 object recognition dataset, CMU-PIE and BANCA face recognition datasets, the results prove that the algorithm performs better than traditional algorithm.

     

/

返回文章
返回