DocumentCode :
3302206
Title :
Romanian Spoken Language Resources and Annotation for Speaker Independent Spontaneous Speech Recognition
Author :
Burileanu, Corneliu ; Buzo, Andi ; Petre, Cristina Sorina ; Ghelmez-hanes, Diana ; Cucu, Horia
Author_Institution :
Fac. of Electron., Telecommun. & Inf. Technol., Univ. Politeh. of Bucharest, Bucharest, Romania
fYear :
2010
fDate :
13-19 June 2010
Firstpage :
7
Lastpage :
10
Abstract :
This paper presents studies and early results with the scope to build a robust spontaneous speech recognition system in Romanian language. We have tried to give solutions to several issues that have arisen like building a large and accurate database within a reasonable time. A short description of the database is given and some statistics are collected in order to show its evolution in several stages of the project. Embedded training technique has been used for training triphones. As a consequence, the alignment problem has been studied and a solution is proposed for it. The final purpose of these attempts is to obtain substantial results in speech recognition for Romanian language that can be used as baseline for further results.
Keywords :
Databases; Information technology; Loudspeakers; Natural languages; Robustness; Scalability; Speech recognition; Statistics; TV broadcasting; Vocabulary; Embedded training; Romanian Triphonest Corpus; Spontaneous Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Telecommunications (ICDT), 2010 Fifth International Conference on
Conference_Location :
Athens, TBD, Greece
Print_ISBN :
978-1-4244-7271-0
Type :
conf
DOI :
10.1109/ICDT.2010.9
Filename :
5532390
Link To Document :
بازگشت