CAO Jian-kai, ZHANG Lian-hai. Query-by-example spoken term detection by applying the HDPHMM tokenizer[J]. JOURNAL OF SIGNAL PROCESSING, 2017, 33(5): 703-710. DOI: 10.16798/j.issn.1003-0530.2017.05.007
Citation: CAO Jian-kai, ZHANG Lian-hai. Query-by-example spoken term detection by applying the HDPHMM tokenizer[J]. JOURNAL OF SIGNAL PROCESSING, 2017, 33(5): 703-710. DOI: 10.16798/j.issn.1003-0530.2017.05.007

Query-by-example spoken term detection by applying the HDPHMM tokenizer

  • This paper presents a study of hierarchical Dirichlet processing hidden Markov model (HDPHMM) approach for unsupervised query-by-example spoken term detection (QbE-STD). First a hierarchical hidden Markov model is applied,in which the top layer states are used for representing the finding acoustic units, bottom layer states are used for modeling the emission probability of top layer states. We can get a nonparametric Bayesian model HDPHMM when imposing a hierarchical Dirichlet processing prior on the top layer states. After the model is trained by unlabeled speech data, it outputs posteriorgram feature vector for test utterance and query term. The posteriorgram feature is optimized by non-negative matrix factorization algorithm. Then the detection is performed by modified SDTW algorithm. Experimental results show that the proposed method outperforms the baseline system based on Gaussian mixture model tokenizer, and improve the detection precision obviously.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return