مرکز منطقه ای اطلاع رساني علوم و فناوري - Class-dependent and differential Huffman coding of compressed feature parameters for distributed speech recognition

DocumentCode :

3528561

Title :

Class-dependent and differential Huffman coding of compressed feature parameters for distributed speech recognition

Author :

Lee, Young Han ; Kim, Deok Su ; Kim, Hong Kook

Author_Institution :

Dept. of Inf. & Commun., Gwangju Inst. of Sci. & Technol., Gwangju

fYear :

2009

fDate :

19-24 April 2009

Firstpage :

4165

Lastpage :

4168

Abstract :

In this paper, we propose an entropy coding method for compressing quantized mel-frequency cepstral coefficients (MFCCs) used for distributed speech recognition (DSR). In the European Telecommunication Standards Institute (ETSI) extended DSR standard, MFCCs are compressed with additional parameters such as pitch and voicing class. The entropy of compressed MFCCs in each analysis frame varies according to the voicing class of the frame, thereby enabling the design of different Huffman trees for MFCCs according to voicing class, referred to here as class-dependent Huffman coding. In addition to the voicing class, the correlation in subvector-wise is utilized for Huffman coding, which is called subvector-wise Huffman coding. It is also explored that differential Huffman coding can further enhance a coding gain against class-dependent Huffman coding and subvector-wise Huffman coding. Based on the benefits above, hybrid types of Huffman coding by combining class-dependent and subvector-wise with differential Huffman coding are compared in this paper. Subsequent experiments show that the average bitrate of subvector-wise differential Huffman coding is measured at 33.93 bits/frame, whereas that of a traditional Huffman coding which does not consider voicing class and encodes with a single Huffman coding tree for all the subvectors is at 42.22 bits/frame.

Keywords :

Huffman codes; cepstral analysis; speech coding; speech recognition; class-dependent coding; compressed feature parameters; differential Huffman coding; distributed speech recognition; entropy coding method; quantized mel-frequency cepstral coefficients; subvector-wise Huffman coding; Bit rate; Cepstral analysis; Discrete cosine transforms; Distributed computing; Entropy coding; Huffman coding; Mel frequency cepstral coefficient; Network servers; Speech recognition; Telecommunication standards; Distributed speech recognition; Huffman coding; MFCC; class-dependent; differential coding; subvector-wise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location :

Taipei

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2009.4960546

Filename :

4960546

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3528561