• DocumentCode
    2174726
  • Title

    Automatic estimation of the second subglottal resonance from natural speech

  • Author

    Arsikere, Harish ; Lulich, Steven M. ; Alwan, Abeer

  • Author_Institution
    Dept. of Electr. Eng., Univ. of California, Los Angeles, CA, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4616
  • Lastpage
    4619
  • Abstract
    This paper deal s with the automatic estimation of the second subglottal resonance (Sg2) from natural speech spoken by adults, since our previous work focused only on estimating Sg2 from isolated diphthongs. A new database comprising speech and subglottal data of native American English (AE) speakers and bilingual Spanish/English speakers was used for the analysis. Data from 11 speakers (6 females and 5 males) were used to derive an empirical relation among the second and third formant frequencies (F2 and F3) and Sg2. Using the derived relation, Sg2 was automatically estimated from voiced sounds in English and Spanish sentences spoken by 20 different speakers (10 males and 10 females). On average, the error in estimating Sg2 was less than 100 Hz in at least 9 isolated AE vowels and less than 40 Hz in continuous speech consisting of English or Spanish sentences.
  • Keywords
    natural language processing; speech processing; AE speaker; American English speaker; Sg2 estimation; automatic estimation; bilingual Spanish-English speaker; natural speech; second subglottal resonance estimation; Acoustics; Databases; Estimation error; Microphones; Speech; Training; automatic estimation; bilingual speech; speaker normalization; subglottal resonances;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947383
  • Filename
    5947383