Filter bank based cepstral features for speaker recognition

Author

Chougule, S.V. ; Chavan, M.S. ; Gaikwad, M.S.

Author_Institution

Finolex Acad. of Manage. & Technol., Ratnagiri, India

fYear

2014

fDate

22-24 Dec. 2014

Firstpage

102

Lastpage

106

Abstract

Speaker recognition in mismatched condition is a vital issue in the recent years as training and testing speech data can be distorted due various practical conditions. Filter bank based cepstral features are used in many speech, speaker and language recognition tasks. These filter banks are mainly designed according to auditory based principles of speech processing, with variation in shape of filters and localization of their frequencies (center, left and right). The set of band pass filters can capture the information related to human vocal tract, which is one of the main distinguishing characteristics of individual. The non-uniform nature of (such as mel-scale warped) filter bank may cause loss of information in high frequency bands, which may carry some speaker specific information. Therefore uniformly (linearly) spaced filter bank cepstral coefficients can capture better speaker specific information, especially in mismatched conditions. In this paper, cepstral features derived from known psycho-acoustic filter banks (called MFCCs and BFCCs,) are compared with uniformly spaced filter bank cepstral coefficients (UFCCs) for text-dependent and text-independent cases. Experimental results shows that BFCCs are better in text-dependent case, whereas UFCC features give improved results than conventional MFCCs in case of text independent case, in mismatch condition. Results indicate that nature of filter bank plays an important role in extracting the speaker relevant features.

Keywords

band-pass filters; channel bank filters; feature extraction; speaker recognition; BFCCs; MFCCs; UFCCs; auditory based principles; band pass filters; filter bank based cepstral features; human vocal tract; language recognition; psycho-acoustic filter banks; speaker recognition; speech data testing; speech data training; speech processing; uniformly spaced filter bank cepstral coefficients; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech recognition; BFCCs; Cepstral features; MFCCs; UFCCs;

fLanguage

English

Publisher

ieee

Conference_Titel

Wireless Computing and Networking (GCWCN), 2014 IEEE Global Conference on

Conference_Location

Lonavala

Type

conf

DOI

10.1109/GCWCN.2014.7030857

Filename

7030857