مرکز منطقه ای اطلاع رساني علوم و فناوري - Modelling Mutual Information between Voiceprint and Optimal Number of Mel-Frequency Cepstral Coefficients in Voice Discrimination

DocumentCode :

259558

Title :

Modelling Mutual Information between Voiceprint and Optimal Number of Mel-Frequency Cepstral Coefficients in Voice Discrimination

Author :

Lin, Kin Wah Edward ; Tian Feng ; Agus, Natalie ; So, Clifford ; Lui, Simon

Author_Institution :

Singapore Univ. of Technol. & Design, Singapore, Singapore

fYear :

2014

fDate :

3-6 Dec. 2014

Firstpage :

Lastpage :

Abstract :

In this paper, we study the relationship between the voiceprint and the optimal number of Mel-frequency Cepstral Coefficients (MFCCs). The voiceprint is modelled as sub-MFCCs matrix with the first d number of MFCCs. We model the relationship through information theory and formulate it as the mutual information maximization problem subject to the probabilities constraint. The solution of this optimization problem provides the optimal number of MFCCs, D among these d, which yields the highest classification accuracy of the voice discrimination, together with a confidence level. This study is dictated by the need to understand the use of MFCCs, which have proliferated since its invention to discriminate voice. We evaluate our model by comparing the leave-one-out cross validation (LOOCV) results of usual multi-class classifier, the Supervised Learning Gaussian Mixture Model (SLGMM), with a set of spoken words and A capella solo vocal performances. The experimental results show that our model is a more comprehensive feature selection criteria for the MFCCs than the de-facto technique, LOOCV.

Keywords :

Gaussian processes; audio signal processing; learning (artificial intelligence); optimisation; signal classification; LOOCV; Mel-frequency cepstral coefficients; SLGMM classifier; classification accuracy; feature selection criteria; information theory; leave-one-out cross validation; multiclass classifier; mutual information maximization problem; mutual information modeling; probabilities constraint; sub-MFCC matrix; supervised learning Gaussian mixture model; voice discrimination; Accuracy; Analytical models; Mutual information; Predictive models; Spectrogram; Training; MFCCs; features selection; information theory; voiceprint;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Applications (ICMLA), 2014 13th International Conference on

Conference_Location :

Detroit, MI

Type :

conf

DOI :

10.1109/ICMLA.2014.9

Filename :

7033085

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=259558