• DocumentCode
    3255261
  • Title

    On the use of stress information in speech for speaker recognition

  • Author

    Narayana, Laxmi M. ; Kopparapu, Sunil Kumar

  • Author_Institution
    TCS Innovation Lab. - Mumbai, Tata Consultancy Services, Thane, India
  • fYear
    2009
  • fDate
    23-26 Jan. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The performance of a speaker recognition system decreases when the speaker is under stress or emotion. In this paper we explore and identify a mechanism that enables use of inherent stress-in-speech or speaking style information present in speech of a person as additional cues for speaker recognition. We quantify the the inherent stress present in the speech of a speaker mainly using 3 features, namely, pitch, amplitude and duration (together called PAD) We experimentally observe that the PAD vectors of similar phones in different words of a speaker are close to each other in the three dimensional (PAD) space confirming that the way a speaker stresses different syllables in their speech is unique to them, thus we propose the use of PAD based speaking style of a speaker as an additional feature for speaker recognition applications.
  • Keywords
    speaker recognition; speech processing; PAD based speaking style; PAD vectors; amplitude; duration; inherent stress; pitch; speaker recognition system; speaking style information; stress information; stress-in-speech; Atherosclerosis; Automatic speech recognition; Human factors; Mel frequency cepstral coefficient; Psychology; Speaker recognition; Speech analysis; Speech recognition; Stress; Technological innovation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2009 - 2009 IEEE Region 10 Conference
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-4546-2
  • Electronic_ISBN
    978-1-4244-4547-9
  • Type

    conf

  • DOI
    10.1109/TENCON.2009.5396003
  • Filename
    5396003