Title :
Unknown-multiple signal source clustering problem using ergodic HMM and applied to speaker classification
Author :
Murakami, J. ; Sugiyama, M. ; Watanabe, Hiromi
Author_Institution :
Inf. & Commun. Syst. Labs., NTT, Japan
Abstract :
The authors consider signals originated from a sequence of sources. More specifically, the problems of segmenting such signals and relating the segments to their sources are addressed. This issue has wide applications in many fields. The report describes a resolution method that is based on an ergodic hidden Markov model (HMM), in which each HMM state corresponds to a signal source. The signal source sequence can be determined by using a decoding procedure (Viterbi algorithm or forward algorithm) over the observed sequence. Baum-Welch training is used to estimate HMM parameters from the training material. As an example of the multiple signal source classification problem, an experiment is performed on unknown speaker classification. The results show a classification rate of 79% for 4 male speakers. The results also indicate that the model is sensitive to the initial values of the ergodic HMM and that employing the long-distance LPC cepstrum is effective for signal preprocessing
Keywords :
Viterbi decoding; acoustic signal processing; cepstral analysis; estimation theory; hidden Markov models; parameter estimation; pattern classification; speech processing; Baum-Welch training; Viterbi algorithm; decoding procedure; ergodic hidden Markov model; forward algorithm; long-distance LPC cepstrum; male speakers; parameter estimation; resolution method; signal preprocessing; signal segmentation; signal source sequence; speaker classification; unknown speaker classification; unknown-multiple signal source clustering problem; Cepstrum; Computer science; Electronic mail; Hidden Markov models; Laboratories; Linear predictive coding; Loudspeakers; Parameter estimation; Speech; Viterbi algorithm;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607294