CSR: Speaker Recognition from Compressed VoIP Packet Stream

Author

Aggarwal, Charu ; Olshefski, David ; Saha, Debanjan ; Shae, Zon-Yin ; Yu, Philip

Author_Institution

IBM Thomas J. Watson Res. Center, Hawthorne, NY

fYear

2005

fDate

6-6 July 2005

Firstpage

970

Lastpage

973

Abstract

VoIP applications require the ability to identify speakers in real time. This paper presents compressed speaker recognition (CSR), an innovative approach to perform speaker recognition directly from the compressed voice packets. CSR performs online speaker recognition from live packet streams of compressed voice packets by performing fast clustering over a defined subset of the features available in each compressed voice packet. Our experimental results show that CSR is highly scalable and accurate across a broad range of speakers

Keywords

Internet telephony; data compression; speaker recognition; CSR; VoIP application; compressed speaker recognition; compressed voice packet; fast clustering; innovative approach; Authentication; Delay; Feature extraction; Filters; Frequency; Internet telephony; Signal analysis; Speaker recognition; Speech analysis; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on

Conference_Location

Amsterdam

Print_ISBN

0-7803-9331-7

Type

conf

DOI

10.1109/ICME.2005.1521586

Filename

1521586

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2239577