DocumentCode
1351847
Title
A Generative Data Augmentation Model for Enhancing Chinese Dialect Pronunciation Prediction
Author
Lin, Chu-Cheng ; Tsai, Richard Tzong-Han
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume
20
Issue
4
fYear
2012
fDate
5/1/2012 12:00:00 AM
Firstpage
1109
Lastpage
1117
Abstract
Most spoken Chinese dialects lack comprehensive digital pronunciation databases, which are crucial for speech processing tasks. Given complete pronunciation databases for related dialects, one can use supervised learning techniques to predict a Chinese character´s pronunciation in a target dialect based on the character´s features and its pronunciation in other related dialects. Unfortunately, Chinese dialect pronunciation databases are far from complete. We propose a novel generative model that makes use of both existing dialect pronunciation data plus medieval rime books to discover patterns that exist in multiple dialects. The proposed model can augment missing dialectal pronunciations based on existing dialect pronunciation tables (even if incomplete) and the pronunciation data in rime books. The augmented pronunciation database can then be used in supervised learning settings. We evaluate the prediction accuracy in terms of phonological features, such as tone, initial phoneme, final phoneme, etc. For each character, features are evaluated on the whole, overall pronunciation feature accuracy (OPFA). Our first experimental results show that adding features from dialectal pronunciation data to our baseline rime-book model dramatically improves OPFA using the support vector machine (SVM) model. In the second experiment, we compare the performance of the SVM model using phonological features from closely related dialects with that of the model using phonological features from non-closely related dialects. The experimental results show that using features from closely related dialects results in higher accuracy. In the third experiment, we show that using our proposed data augmentation model to fill in missing data can increase the SVM model´s OPFA by up to 7.6%.
Keywords
data handling; learning (artificial intelligence); natural language processing; speech processing; support vector machines; Chinese dialect pronunciation prediction; OPFA; baseline rime book model; digital pronunciation databases; generative data augmentation model; medieval rime books; missing dialectal pronunciation augmentation; overall pronunciation feature accuracy; pattern discovery; phonological features; prediction accuracy; speech processing; supervised learning; support vector machine model; Books; Data models; Databases; Dictionaries; Speech; Speech processing; Support vector machines; Chinese dialects; data augmentation; generative model; pronunciation database;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2011.2172424
Filename
6047570
Link To Document