DocumentCode :
271472
Title :
Statistics of diphones and triphones presence on the word boundaries in the Polish language. Applications to ASR
Author :
Ziółko, Bartosz ; Żelasko, Piotr ; Skurzok, Dawid
Author_Institution :
AGH Univ. of Sci. & Technol., Kraków, Poland
fYear :
2014
fDate :
11-13 April 2014
Firstpage :
1
Lastpage :
6
Abstract :
Recognition of continuous speech is one of the major challenges in automatic speech recognition (ASR), especially in phonetically complex languages (i.e. Polish). To improve ASR of the Polish language, we obtained phoneme statistics to locate diphones and triphones within the running speech sequences. We found that these clusters occur more likely between the words boundaries rather than within the word boundaries. Our research identified the most frequently appearing diphones and triphones in the natural speech corpus (Corpora) and we normalized these data for the Polish language at large. The results can be used in the various ASR application systems, i.e. by the speech recognizer module to enhance word boundaries recognitions, or to recognize non-dictionary words embedded in a natural sentence, (e.g. proper names).
Keywords :
natural language processing; speech recognition; statistical analysis; ASR; Polish language; automatic speech recognition; diphones; natural sentence; natural speech corpus; nondictionary word; phoneme statistics; phonetically complex language; triphones; word boundary recognition; Automatic speech recognition; Educational institutions; Electronic mail; Probability; Speech; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pacific Voice Conference (PVC), 2014 XXII Annual
Conference_Location :
Krakow
Print_ISBN :
978-1-4799-3699-1
Type :
conf
DOI :
10.1109/PVC.2014.6845418
Filename :
6845418
Link To Document :
بازگشت