Query-by-example spoken term detection by applying the HDPHMM tokenizer

CAO Jian-kai; ZHANG Lian-hai

doi:10.16798/j.issn.1003-0530.2017.05.007

CAO Jian-kai, ZHANG Lian-hai. Query-by-example spoken term detection by applying the HDPHMM tokenizer[J]. JOURNAL OF SIGNAL PROCESSING, 2017, 33(5): 703-710. DOI: 10.16798/j.issn.1003-0530.2017.05.007

Citation:

Query-by-example spoken term detection by applying the HDPHMM tokenizer

Graphical Abstract

Abstract

Abstract

This paper presents a study of hierarchical Dirichlet processing hidden Markov model (HDPHMM) approach for unsupervised query-by-example spoken term detection (QbE-STD). First a hierarchical hidden Markov model is applied，in which the top layer states are used for representing the finding acoustic units, bottom layer states are used for modeling the emission probability of top layer states. We can get a nonparametric Bayesian model HDPHMM when imposing a hierarchical Dirichlet processing prior on the top layer states. After the model is trained by unlabeled speech data, it outputs posteriorgram feature vector for test utterance and query term. The posteriorgram feature is optimized by non-negative matrix factorization algorithm. Then the detection is performed by modified SDTW algorithm. Experimental results show that the proposed method outperforms the baseline system based on Gaussian mixture model tokenizer, and improve the detection precision obviously.

FullText(HTML)

References (16)

Supplements (0)

Cited By

Query-by-example spoken term detection by applying the HDPHMM tokenizer

Abstract

Catalog

Export File

Citation

Format

Content