Title :
Speaker independent discriminant feature extraction for acoustic pattern-matching
Author_Institution :
Telefonica Res., Barcelona, Spain
Abstract :
Acoustic pattern-matching algorithms have recently become prominent again for automatically processing speech utterances where no prior knowledge of the spoken language is required. Applications of such technology include, but are not limited to, query-by-example search, spoken term detection and automatic word discovery. Obtaining content-aware acoustic features as independent as possible from speaker and acoustic environment variations is a key step in these algorithms. Currently, GMM posteriorgrams are found to outperform the standard MFCC features even though they were not designed to optimize the discrimination between acoustic classes. In this paper we combine the K-means clustering algorithm with the GMM posteriorgrams front-end to obtain more discriminant features. Results on a query-by-example task show that the proposed approaches outperform standard MFCC features by 7.8% absolute P@N and GMM-based posteriorgram features by 3.7% absolute P@N when using a 64-dimensional feature vector.
Keywords :
Gaussian distribution; feature extraction; pattern clustering; speaker recognition; speech processing; 64-dimensional feature vector; GMM posteriorgrams; GMM-based posteriorgram features; K-means clustering; acoustic environment variations; acoustic pattern-matching; automatic processing; automatic word discovery; content-aware acoustic features; query-by-example search; speaker environment variations; speaker independent discriminant feature extraction; speech utterances; spoken term detection; standard MFCC features; Acoustics; Clustering algorithms; Data models; Speech; Standards; Training; Vectors; Pattern-matching; k-means; query-by-example; word discovery;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6287922