DocumentCode :
3702120
Title :
Large scale data based linguistic investigations using speech technology tools: The case of Romanian
Author :
Ioana Vasilescu;Camille Dutrey;Lori Lamel
Author_Institution :
LIMSI, CNRS, Paris-Saclay University, B?t 508, Campus Universitaire, F-91405 Orsay France
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
This paper provides a summary of previous efforts made to build an ASR system for Romanian. Thereafter, the data developed within the ASR framework are used to conduct linguistic studies. A first study is dedicated to morpho-phonetic processes in Romanian such as the deletion of masculine definite article -l and the realization of the word final palatalized consonants as plural marker in nouns and person marker in verb conjugation. Data shows that the two phenomena are variable in continuous speech and depend on the degree of spontaneity of the corpus. The second study is dedicated to Romanian vowels acoustic properties. This study takes into account a 7 hours corpus used as development and evaluation data to build the ASR system. Data confirm a seven-vowel system. They also highlight an acoustic proximity and a complementary distribution of the non low central vowels [] and [Λ]. The current findings support previous hypotheses built from laboratory data investigations and encourage further explorations on large scale data.
Keywords :
"Microwave integrated circuits","Lungs","Context"
Publisher :
ieee
Conference_Titel :
Speech Technology and Human-Computer Dialogue (SpeD), 2015 International Conference on
Type :
conf
DOI :
10.1109/SPED.2015.7343108
Filename :
7343108
Link To Document :
بازگشت