• DocumentCode
    730666
  • Title

    Estimation of the invariant and variant characteristics in speech articulation and its application to speaker identification

  • Author

    Prasad, Abhay ; Periyasamy, Vijitha ; Ghosh, Prasanta Kumar

  • Author_Institution
    Manipal Inst. of Technol., Manipal, India
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4265
  • Lastpage
    4269
  • Abstract
    Speech articulation varies across speakers for producing a speech sound due to the differences in their vocal tract morphologies, though the speech motor actions are executed in terms of relatively invariant gestures [1]. While the invariant articulatory gestures are driven by the linguistic content of the spoken utterance, the component of speech articulation that varies across speakers reflects speaker-specific and other paralinguistic information. In this work, we present a formulation to decompose the speech articulation from multiple speakers into the variant and invariant aspects when they speak the same sentence. The variant component is found to be a better representation for discriminating speakers compared to the speech articulation which includes the invariant part. Experiments with real-time magnetic resonance imaging (rtMRI) videos of speech production from multiple speakers reveal that the variant component of speech articulation yields a better frame-level speaker identification accuracy compared to the speech articulation as well as acoustic features by 29.9% and 9.4% (absolute) respectively.
  • Keywords
    magnetic resonance imaging; signal representation; sound reproduction; speaker recognition; discriminating speaker representation; frame-level speaker identification; invariant estimation; real-time magnetic resonance imaging; rtMRI; speaker identification; speech articulation; speech production; spoken utterance linguistic content; videos; Accuracy; Acoustics; Estimation; Linear programming; Speech; Speech processing; Subspace constraints; invariant gestures; speaker identification; speech articulation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178775
  • Filename
    7178775