Title :
Automatic estimation of the second subglottal resonance from natural speech
Author :
Arsikere, Harish ; Lulich, Steven M. ; Alwan, Abeer
Author_Institution :
Dept. of Electr. Eng., Univ. of California, Los Angeles, CA, USA
Abstract :
This paper deal s with the automatic estimation of the second subglottal resonance (Sg2) from natural speech spoken by adults, since our previous work focused only on estimating Sg2 from isolated diphthongs. A new database comprising speech and subglottal data of native American English (AE) speakers and bilingual Spanish/English speakers was used for the analysis. Data from 11 speakers (6 females and 5 males) were used to derive an empirical relation among the second and third formant frequencies (F2 and F3) and Sg2. Using the derived relation, Sg2 was automatically estimated from voiced sounds in English and Spanish sentences spoken by 20 different speakers (10 males and 10 females). On average, the error in estimating Sg2 was less than 100 Hz in at least 9 isolated AE vowels and less than 40 Hz in continuous speech consisting of English or Spanish sentences.
Keywords :
natural language processing; speech processing; AE speaker; American English speaker; Sg2 estimation; automatic estimation; bilingual Spanish-English speaker; natural speech; second subglottal resonance estimation; Acoustics; Databases; Estimation error; Microphones; Speech; Training; automatic estimation; bilingual speech; speaker normalization; subglottal resonances;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947383