• DocumentCode
    2834622
  • Title

    Neural network models for combining evidence from spectral and suprasegmental features for text-dependent speaker verification

  • Author

    Prasanna, S. R Mahadeva ; Zachariah, Jinu Mariam ; Yegnanarayana, B.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
  • fYear
    2004
  • fDate
    2004
  • Firstpage
    359
  • Lastpage
    363
  • Abstract
    This paper proposes a method using neural network models for combining evidence from spectral and suprasegmental features for text-dependent speaker verification. Spectral features are extracted using the Dynamic Time Warping (DTW) technique. While extracting the spectral features, the DTW algorithm is used only to obtain a matching score and the information present in the warping path is ignored. In this work a method is discussed to extract suprasegmental features such as pitch and duration using the information in the warping path. Although the suprasegmental features may not yield good performance, combining the evidence from suprasegmental and spectral features improves the performance of the speaker verification system significantly.
  • Keywords
    feature extraction; neural nets; speaker recognition; dynamic time warping technique; neural network model; pitch extraction; spectral feature extraction; suprasegmental feature extraction; text dependent speaker verification; Biometrics; Computer science; Data mining; Feature extraction; Laboratories; Neural networks; Speaker recognition; Speech analysis; Speech processing; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
  • Print_ISBN
    0-7803-8243-9
  • Type

    conf

  • DOI
    10.1109/ICISIP.2004.1287683
  • Filename
    1287683