مرکز منطقه ای اطلاع رساني علوم و فناوري - Histogram-Based Quantization for Robust and/or Distributed Speech Recognition

DocumentCode :

1131682

Title :

Histogram-Based Quantization for Robust and/or Distributed Speech Recognition

Author :

Wan, Chia-yu ; Lee, Lin-shan

Author_Institution :

Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei

Volume :

Issue :

fYear :

2008

fDate :

5/1/2008 12:00:00 AM

Firstpage :

859

Lastpage :

873

Abstract :

In a distributed speech recognition (DSR) framework, the speech features are quantized and compressed at the client and recognized at the server. However, recognition accuracy is degraded by environmental noise at the input, quantization distortion, and transmission errors. In this paper, histogram-based quantization (HQ) is proposed, in which the partition cells for quantization are dynamically defined by the histogram or order statistics of a segment of the most recent past values of the parameter to be quantized. This scheme is shown to be able to solve to a good degree many problems related to DSR. A joint uncertainty decoding (JUD) approach is further developed to consider the uncertainty caused by both environmental noise and quantization errors. A three-stage error concealment (EC) framework is also developed to handle transmission errors. The proposed HQ is shown to be an attractive feature transformation approach for robust speech recognition outside of a DSR environment as well. All the claims have been verified by experiments using the Aurora 2 testing environment, and significant performance improvements for both robust and/or distributed speech recognition over conventional approaches have been achieved.

Keywords :

decoding; feature extraction; quantisation (signal); speech recognition; Aurora 2 testing environment; distributed speech recognition framework; environmental noise; feature transformation approach; histogram-based quantization; joint uncertainty decoding approach; partition cells; quantization distortion; quantization errors; robust speech recognition; speech features; three-stage error concealment framework; transmission errors; Decoding; Degradation; Histograms; Noise robustness; Quantization; Speech recognition; Statistics; Testing; Uncertainty; Working environment noise; Error compensation; robustness; speech recognition; vector quantization (VQ);

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.920891

Filename :

4490004

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1131682