DocumentCode :
2256280
Title :
Statistical dialect classification based on mean phonetic features
Author :
Miller, David R. ; Trischitta, James
Author_Institution :
BBN Hark Syst., Cambridge, MA, USA
Volume :
4
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
2025
Abstract :
Describes work done on a text-dependent method for automatic utterance classification and dialect model selection using mean cepstral and duration features on a per-phoneme basis. From transcribed dialect data, we build a linear discriminant to separate the dialects in feature space. This method is potentially much faster than our previous selection algorithm. We have been able to achieve error rates of 8% for distinguishing Northern US speakers from Southern US speakers, and average error rates of 13% on a variety of finer pairwise dialect discriminations. We also present a description of the training and test corpora collected for this work
Keywords :
feature extraction; linguistics; pattern classification; speech recognition; statistical analysis; Northern US speakers; Southern US speakers; automatic utterance classification; dialect model selection; error rates; linear discriminant; mean phonetic features; pairwise dialect discriminations; phoneme cepstral features; phoneme duration features; statistical dialect classification; test corpus; text-dependent method; training corpus; transcribed dialect data; Automatic speech recognition; Cepstral analysis; Computer aided analysis; Decoding; Error analysis; Hidden Markov models; Humans; Speech recognition; System testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607196
Filename :
607196
Link To Document :
بازگشت