مرکز منطقه ای اطلاع رساني علوم و فناوري - A comparison of several acoustic representations for speech recognition with degraded and undegraded speech

DocumentCode :

3521712

Title :

A comparison of several acoustic representations for speech recognition with degraded and undegraded speech

Author :

Hunt, Melvyn J. ; Lefèbvre, Claude

Author_Institution :

Nat. Res. Council of Canada, Ottawa, Ont., Canada

fYear :

1989

fDate :

23-26 May 1989

Firstpage :

262

Abstract :

Several acoustic representations have been compared in speaker-dependent and independent connected and isolated-word recognition tests with undegraded speech and with speech degraded by adding white noise and by applying a 6-dB/octave spectral tilt. The representations comprised the output of an auditory model, cepstrum coefficients derived from an FFT-based mel-scale filter bank with various weighting schemes applied to the coefficients, cepstrum coefficients augmented with measures of their rates of change with time, and sets of linear discriminant functions derived from the filter-bank output and called IMELDA. The model outperformed the cepstrum representations except in noise-free connected-word tests, where it had a high insertion rate. The best cepstrum weighting scheme was derived from within-class variances. Its behavior may explain the empirical adjustments found necessary with other schemes. IMELDA outperformed all other representations in all conditions and is computationally simple

Keywords :

speech recognition; FFT-based mel-scale filter bank; IMELDA; acoustic representations; auditory model; cepstrum coefficients; degraded; independent connected; insertion rate; isolated-word recognition tests; linear discriminant functions; speaker-dependent; spectral tilt; speech recognition; undegraded speech; weighting schemes; white noise; within-class variances; Acoustic testing; Cepstrum; Councils; Degradation; Filter bank; Frequency; Speech enhancement; Speech recognition; Time measurement; White noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on

Conference_Location :

Glasgow

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.1989.266415

Filename :

266415

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3521712