DocumentCode :
3163955
Title :
Regularising an adaptation algorithm for tongue shape models
Author :
Farhadloo, Mohsen ; Carreira-Perpiñán, Miguel Á
Author_Institution :
EECS, Univ. of California, Merced, Merced, CA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4481
Lastpage :
4484
Abstract :
Realistic data-driven models of the tongue shape can be obtained by learning a nonlinear mapping from tongue landmarks to full contours, trained on a dataset of thousands of contours. Semiautomatic contour extraction from ultrasound takes a lot of time and effort from an expert, so practically it is preferable to adapt a reference model given just a few contours from the new speaker. However, adaptation with very few contours is unreliable and prone to overfitting. We study several forms of regularisation to constrain the adaptation, and determine the optimal amount of regularisation by leave-one-out cross-validation. Our results show that good accuracy models can be found reliably with no user intervention.
Keywords :
speech recognition; adaptation algorithm; leave-one-out cross-validation; nonlinear mapping; realistic data-driven models; reference model; regularisation; semiautomatic contour extraction; speaker; tongue landmarks; tongue shape models; ultrasound; Adaptation models; Biological system modeling; Optimization; Reactive power; Shape; Tongue; Ultrasonic imaging; articulatory databases; model adaptation; regularisation; tongue model; ultrasound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288915
Filename :
6288915
Link To Document :
بازگشت