DocumentCode
575629
Title
Statistical phonetic analysis of the Romanian language for speech recognition and synthesis tasks
Author
Stanescu, Miruna ; Buzo, Andi ; Cucu, H. ; Burileanu, C.
Author_Institution
Univ. Politeh. of Bucharest, Bucharest, Romania
fYear
2012
fDate
12-14 Sept. 2012
Firstpage
219
Lastpage
222
Abstract
This article provides a statistical phonetic analysis based on the largest Romanian text corpus collected so far for research purposes. Several types of phonetic events are analyzed: phones, diphones, triphones, and phone clusters based on the general classification of phones in the Romanian language. Some interesting conclusions are drawn, such as the fact that less than half the diphones cover 99% of the whole text. The article also discusses some usages of these phonetic statistics for spoken language technology tasks.
Keywords
speech processing; statistical analysis; Romanian language; Romanian text corpus; diphones; general classification; phone clusters; phonetic events; phonetic statistics; speech recognition; speech synthesis tasks; spoken language technology tasks; statistical phonetic analysis; triphones; Automatic speech recognition; Buildings; Databases; Speech; Speech processing; Training; Automatic speech recognition; Phonetic event; Spoken language technology; Text-to-speech;
fLanguage
English
Publisher
ieee
Conference_Titel
ELMAR, 2012 Proceedings
Conference_Location
Zadar
ISSN
1334-2630
Print_ISBN
978-1-4673-1243-1
Type
conf
Filename
6338510
Link To Document