Title :
Regularising an adaptation algorithm for tongue shape models
Author :
Farhadloo, Mohsen ; Carreira-Perpiñán, Miguel Á
Author_Institution :
EECS, Univ. of California, Merced, Merced, CA, USA
Abstract :
Realistic data-driven models of the tongue shape can be obtained by learning a nonlinear mapping from tongue landmarks to full contours, trained on a dataset of thousands of contours. Semiautomatic contour extraction from ultrasound takes a lot of time and effort from an expert, so practically it is preferable to adapt a reference model given just a few contours from the new speaker. However, adaptation with very few contours is unreliable and prone to overfitting. We study several forms of regularisation to constrain the adaptation, and determine the optimal amount of regularisation by leave-one-out cross-validation. Our results show that good accuracy models can be found reliably with no user intervention.
Keywords :
speech recognition; adaptation algorithm; leave-one-out cross-validation; nonlinear mapping; realistic data-driven models; reference model; regularisation; semiautomatic contour extraction; speaker; tongue landmarks; tongue shape models; ultrasound; Adaptation models; Biological system modeling; Optimization; Reactive power; Shape; Tongue; Ultrasonic imaging; articulatory databases; model adaptation; regularisation; tongue model; ultrasound;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288915