PCA联合子空间理论的规范化与扩展

Theoretical Normalization and Generalization of PCA Joint Subspace Model

  • 摘要: 对于高维数据的分类,主成分分析(PCA)联合子空间可为各类数据建立更为细致的概率模型,从而提高贝叶斯分类的准确性。本文首先对PCA联合子空间理论进行了规范化,提出了两个基本假设,并从理论上证明了残差子空间参数“代表特征根”的启发式取值正是其极大似然估计。本文进一步对样本残差的概率模型进行了扩展,提出了扩展型逐类联合子空间算法。最后,本文通过在真实数据上实验结果证明了扩展型逐类联合子空间算法的优越性。

     

    Abstract: For the classification of high-dimensional data, PCA joint subspace model can accurately describe the probability distribution of the sample data of one class and thus improve the classification accuracy of the corresponding Bayesian classifier. In this paper, we firstly make certain theoretical normalization of the PCA joint subspace. Particularly, its two basic assumptions are proposed. Moreover, it is proved that the used heuristic value of the parameter referred to as “representative eigenvalue” for the residual subspace is just its maximum likelihood estimate. We further generalize the expression of the probability distribution of the residual subspace and establish the generalized class-wise joint subspace algorithm for Bayesian classification. Finally, the experimental results on several real-world datasets demonstrate the superiority of the generalized class-wise joint subspace algorithm.

     

/

返回文章
返回