DocumentCode :
2174726
Title :
Automatic estimation of the second subglottal resonance from natural speech
Author :
Arsikere, Harish ; Lulich, Steven M. ; Alwan, Abeer
Author_Institution :
Dept. of Electr. Eng., Univ. of California, Los Angeles, CA, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4616
Lastpage :
4619
Abstract :
This paper deal s with the automatic estimation of the second subglottal resonance (Sg2) from natural speech spoken by adults, since our previous work focused only on estimating Sg2 from isolated diphthongs. A new database comprising speech and subglottal data of native American English (AE) speakers and bilingual Spanish/English speakers was used for the analysis. Data from 11 speakers (6 females and 5 males) were used to derive an empirical relation among the second and third formant frequencies (F2 and F3) and Sg2. Using the derived relation, Sg2 was automatically estimated from voiced sounds in English and Spanish sentences spoken by 20 different speakers (10 males and 10 females). On average, the error in estimating Sg2 was less than 100 Hz in at least 9 isolated AE vowels and less than 40 Hz in continuous speech consisting of English or Spanish sentences.
Keywords :
natural language processing; speech processing; AE speaker; American English speaker; Sg2 estimation; automatic estimation; bilingual Spanish-English speaker; natural speech; second subglottal resonance estimation; Acoustics; Databases; Estimation error; Microphones; Speech; Training; automatic estimation; bilingual speech; speaker normalization; subglottal resonances;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947383
Filename :
5947383
Link To Document :
بازگشت