DocumentCode :
259558
Title :
Modelling Mutual Information between Voiceprint and Optimal Number of Mel-Frequency Cepstral Coefficients in Voice Discrimination
Author :
Lin, Kin Wah Edward ; Tian Feng ; Agus, Natalie ; So, Clifford ; Lui, Simon
Author_Institution :
Singapore Univ. of Technol. & Design, Singapore, Singapore
fYear :
2014
fDate :
3-6 Dec. 2014
Firstpage :
15
Lastpage :
20
Abstract :
In this paper, we study the relationship between the voiceprint and the optimal number of Mel-frequency Cepstral Coefficients (MFCCs). The voiceprint is modelled as sub-MFCCs matrix with the first d number of MFCCs. We model the relationship through information theory and formulate it as the mutual information maximization problem subject to the probabilities constraint. The solution of this optimization problem provides the optimal number of MFCCs, D among these d, which yields the highest classification accuracy of the voice discrimination, together with a confidence level. This study is dictated by the need to understand the use of MFCCs, which have proliferated since its invention to discriminate voice. We evaluate our model by comparing the leave-one-out cross validation (LOOCV) results of usual multi-class classifier, the Supervised Learning Gaussian Mixture Model (SLGMM), with a set of spoken words and A capella solo vocal performances. The experimental results show that our model is a more comprehensive feature selection criteria for the MFCCs than the de-facto technique, LOOCV.
Keywords :
Gaussian processes; audio signal processing; learning (artificial intelligence); optimisation; signal classification; LOOCV; Mel-frequency cepstral coefficients; SLGMM classifier; classification accuracy; feature selection criteria; information theory; leave-one-out cross validation; multiclass classifier; mutual information maximization problem; mutual information modeling; probabilities constraint; sub-MFCC matrix; supervised learning Gaussian mixture model; voice discrimination; Accuracy; Analytical models; Mutual information; Predictive models; Spectrogram; Training; MFCCs; features selection; information theory; voiceprint;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2014 13th International Conference on
Conference_Location :
Detroit, MI
Type :
conf
DOI :
10.1109/ICMLA.2014.9
Filename :
7033085
Link To Document :
بازگشت