Transfer Discriminant Regression for Cross-domain Speech Emotion Recognition

SONG Peng; LI Shaokai; ZHANG Wenjing; ZHENG Wenming; ZHAO Li

doi:10.16798/j.issn.1003-0530.2023.04.006

SONG Peng, LI Shaokai, ZHANG Wenjing, ZHENG Wenming, ZHAO Li. Transfer Discriminant Regression for Cross-domain Speech Emotion Recognition[J]. JOURNAL OF SIGNAL PROCESSING, 2023, 39(4): 649-657. DOI: 10.16798/j.issn.1003-0530.2023.04.006

Citation:

Transfer Discriminant Regression for Cross-domain Speech Emotion Recognition

Graphical Abstract

Abstract

Abstract

‍ ‍To solve the problem that the training and testing data come from different domain databases in actual situation， which leads to the decline of recognition performance， we proposed a transfer discriminant regression method for cross-domain speech emotion recognition. Specifically， first， we employed maximum mean discrepancy （MMD） and graph Laplacian as the distance measurement between domains to reduce the distribution difference while preserving the local geometrical structure. Thus， we can learn a transferable common feature representation. To ensure that the information of target corpus is not lost in the process of knowledge transfer， an energy conservation strategy was proposed. Second， we trained a transferable regression model by using labeled source domain samples in the common subspace. We imposed an $L_{2,1}$ -norm constraint on the common projection matrix and regression term， which can make the model be more robust. The experimental results on three public datasets show that the proposed approach outperforms the other transfer learning methods.

FullText(HTML)

References (29)

Supplements (0)

Cited By

Transfer Discriminant Regression for Cross-domain Speech Emotion Recognition

Abstract

Catalog

Export File

Citation

Format

Content