Title :
A comparative study of speech segmentation and feature extraction on the recognition of different dialects
Author :
Li, Bavy N L ; Liu, James N K
Author_Institution :
Dept. of Comput., Hong Kong Polytech., Kowloon, Hong Kong
Abstract :
Speech is the most intuitive way of communication, except for mutes and deaf-mutes. Hong Kong, a multicultural society, is an ideal place to develop a multi-lingual (Cantonese, Mandarin, and English) ASRVR system. Once this happened, numerous techniques were explored of the three major stages of speech data: segmentation, preprocessing, and recognition. The speech segmentation process includes the convex hull method for finding Smin, SVF for Smax, and normal decomposition for Sopt. The sub-word boundaries are considered by the LBDP-based method. The three proposed feature extraction methods for speech preprocessing are MFCCs, RASTA, and FBDCCs. Speech recognition is the final phase of our system. Approaches including Navie Bayesian classification, HMM with Viterbi algorithm, and backpropagation with a two-hidden-layer structure are studied in this paper. The best performance for multi-lingual recognition in our system can be found as applying MFCCs into HMM, after the segmentation procedure
Keywords :
Bayes methods; backpropagation; cepstral analysis; feature extraction; hidden Markov models; speech recognition; HMM; Hong Kong; Navie Bayesian classification; SVF; Viterbi algorithm; backpropagation; convex hull method; dialect recognition; feature extraction; multi-lingual ASRVR system; multi-lingual recognition; multicultural society; normal decomposition; speech preprocessing; speech recognition; speech segmentation; sub-word boundaries; two-hidden-layer structure; Bayesian methods; Cepstral analysis; Deafness; Dynamic programming; Feature extraction; Hidden Markov models; Image segmentation; Speech analysis; Speech processing; Speech recognition;
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7803-5731-0
DOI :
10.1109/ICSMC.1999.814149