Automatic estimation of the second subglottal resonance from natural speech

Author

Arsikere, Harish ; Lulich, Steven M. ; Alwan, Abeer

Author_Institution

Dept. of Electr. Eng., Univ. of California, Los Angeles, CA, USA

fYear

2011

fDate

22-27 May 2011

Firstpage

4616

Lastpage

4619

Abstract

This paper deal s with the automatic estimation of the second subglottal resonance (Sg2) from natural speech spoken by adults, since our previous work focused only on estimating Sg2 from isolated diphthongs. A new database comprising speech and subglottal data of native American English (AE) speakers and bilingual Spanish/English speakers was used for the analysis. Data from 11 speakers (6 females and 5 males) were used to derive an empirical relation among the second and third formant frequencies (F2 and F3) and Sg2. Using the derived relation, Sg2 was automatically estimated from voiced sounds in English and Spanish sentences spoken by 20 different speakers (10 males and 10 females). On average, the error in estimating Sg2 was less than 100 Hz in at least 9 isolated AE vowels and less than 40 Hz in continuous speech consisting of English or Spanish sentences.

Keywords

natural language processing; speech processing; AE speaker; American English speaker; Sg2 estimation; automatic estimation; bilingual Spanish-English speaker; natural speech; second subglottal resonance estimation; Acoustics; Databases; Estimation error; Microphones; Speech; Training; automatic estimation; bilingual speech; speaker normalization; subglottal resonances;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location

Prague

ISSN

1520-6149

Print_ISBN

978-1-4577-0538-0

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2011.5947383

Filename

5947383