Title :
Dialect distance assessment method based on comparison of pitch pattern statistical models
Author :
Mahnoosh Mehrabani;Hynek Bořil;John H.L. Hansen
Author_Institution :
Center for Robust Speech Systems, Erik Jonsson School of Engineering & Computer Science, The University of Texas at Dallas, USA
Abstract :
Dialect variations of a language have a severe impact on the performance of speech systems. Therefore, knowing how close or diverse dialects are in a given language space provides useful information to predict, or improve, system performance when there is a mismatch between train and test data. Distance measures have been used in several applications of speech processing. However, apart from phonetic measures, little if any work has been done on dialect distance measurement. This study explores differences in pitch movement microstructure among dialects. A method of dialect distance assessment based on pitch patterns modeled progressively from pitch contour primitives is proposed. The presented method does not require any manual labeling and is text-independent. The KL divergence is employed to compare the resulting statistical models. The proposed scheme is evaluated on a corpus of Arabic dialects, and shown to be consistent with the results from the spectral-based dialect classification system. Finally, it is also shown using a perceptive evaluation that the proposed objective approach correlates well with subjective distances.
Keywords :
"Natural languages","Speech processing","Testing","Distortion measurement","System performance","Speech recognition","Speech coding","Distance measurement","Feature extraction","Speech synthesis"
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
2379-190X
DOI :
10.1109/ICASSP.2010.5495019