• DocumentCode
    2239577
  • Title

    CSR: Speaker Recognition from Compressed VoIP Packet Stream

  • Author

    Aggarwal, Charu ; Olshefski, David ; Saha, Debanjan ; Shae, Zon-Yin ; Yu, Philip

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Hawthorne, NY
  • fYear
    2005
  • fDate
    6-6 July 2005
  • Firstpage
    970
  • Lastpage
    973
  • Abstract
    VoIP applications require the ability to identify speakers in real time. This paper presents compressed speaker recognition (CSR), an innovative approach to perform speaker recognition directly from the compressed voice packets. CSR performs online speaker recognition from live packet streams of compressed voice packets by performing fast clustering over a defined subset of the features available in each compressed voice packet. Our experimental results show that CSR is highly scalable and accurate across a broad range of speakers
  • Keywords
    Internet telephony; data compression; speaker recognition; CSR; VoIP application; compressed speaker recognition; compressed voice packet; fast clustering; innovative approach; Authentication; Delay; Feature extraction; Filters; Frequency; Internet telephony; Signal analysis; Speaker recognition; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
  • Conference_Location
    Amsterdam
  • Print_ISBN
    0-7803-9331-7
  • Type

    conf

  • DOI
    10.1109/ICME.2005.1521586
  • Filename
    1521586