Title :
An SVD-based scheme for MFCC compression in distributed speech recognition system
Author :
Touazi, Azzedine ; Debyeche, Mohamed
Author_Institution :
Signal Process. & Speech Commun. Lab., Univ. of Sci. & Technol. Houari Boumediene, Algiers, Algeria
Abstract :
This paper proposes a new scheme for low bit-rate source coding of Mel Frequency Cepstral Coefficients (MFCCs) in Distributed Speech Recognition (DSR) system. The method uses the compressed ETSI Advanced Front-End (ETSI-AFE) features factorized into SVD components. By investigating the correlation property between successive MFCC frames, the odd ones are encoded using ETSI-AFE, while only the singular values and the nearest left singular vectors index are encoded and transmitted for the even frames. At the server side, the non-transmitted MFCCs are evaluated through their quantized singular values and the nearest left singular vectors. The system provides a compression bit-rate of 2.7 kbps. The recognition experiments were carried out on the Aurora-2 database for clean and multi-condition training modes. The simulation results show good recognition performance without significant degradation, with respect to the ETSI-AFE encoder.
Keywords :
data compression; singular value decomposition; speech recognition; Aurora-2 database; DSR system; ETSI-AFE encoder; ETSI-AFEw; MFCC compression; Mel frequency cepstral coefficients; SVD based scheme; SVD components; bit rate source coding; compressed ETSI advanced front end; distributed speech recognition; distributed speech recognition system; singular value decomposition; Indexes; Mel frequency cepstral coefficient; Speech; Speech recognition; Telecommunication standards; Training; Vectors; Distributed speech recognition; ETSI-AFE standard; MFCC coefficients; SVD decomposition;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location :
Olomouc
DOI :
10.1109/ASRU.2013.6707738