Title :
A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models
Author :
Chen, Nancy F. ; Shen, Wade ; Campbell, Joseph P.
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
Abstract :
We propose supervised and unsupervised learning algorithms to extract dialect discriminating phonetic rules and use these rules to adapt biphones to identify dialects. Despite many challenges (e.g., sub-dialect issues and no word transcriptions), we discovered dialect discriminating biphones compatible with the linguistic literature, while outperforming a baseline monophone system by 7.5% (relative). Our proposed dialect discriminating biphone system achieves similar performance to a baseline all-biphone system despite using 25% fewer biphone models. In addition, our system complements PRLM (Phone Recognition followed by Language Modeling), verified by obtaining relative gains of 15-29% when fused with PRLM. Our work is an encouraging first step towards a linguistically-informative dialect recognition system, with potential applications in forensic phonetics, accent training, and language learning.
Keywords :
speaker recognition; speech processing; unsupervised learning; baseline monophone system; dialect discriminating biphones; dialect-discriminating context-dependent phonetic models; linguistically-informative dialect recognition system; phone recognition-language modeling; supervised learning algorithms; unsupervised learning algorithms; Acoustic noise; Context modeling; Forensics; Hidden Markov models; Humans; Laboratories; Natural languages; Pattern recognition; Speech analysis; Unsupervised learning; dialect recognition; phonetic rules;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495068