DocumentCode :
2789044
Title :
Framework for cross-language automatic phonetic segmentation
Author :
Ogbureke, Kalu U. ; Carson-Berndsen, Julie
Author_Institution :
Sch. of Comput. Sci. & Inf., Univ. Coll. Dublin, Dublin, Ireland
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5266
Lastpage :
5269
Abstract :
Annotation of large multilingual corpora remains a challenge to the data-driven approach to speech research, especially for under-resourced languages. This paper presents cross-language automatic phonetic segmentation using Hidden Markov Models (HMMs). The underlying notion is segmentation based on articulation (manner and place) so as to provide extensive models that will be applicable across languages. A test on the Appen Spanish speech corpus gives phone recognition accuracy of 61.15% when bootstrapped with acoustic models trained on the TIMIT as compared with a baseline result of 54.63% for flat start initialization of the monophone models.
Keywords :
hidden Markov models; natural language processing; speech processing; Appen Spanish speech corpus; annotation; cross-language automatic phonetic segmentation; hidden Markov models; multilingual corpora; phone recognition; speech research; Acoustic testing; Automatic speech recognition; Computer science; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Natural languages; Noise robustness; Speech recognition; Speech synthesis; Automatic phonetic segmentation; articulatory features; hidden Markov models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494978
Filename :
5494978
Link To Document :
بازگشت