مرکز منطقه ای اطلاع رساني علوم و فناوري - Suprasegmental features and continuous speech recognition

DocumentCode :

290355

Title :

Suprasegmental features and continuous speech recognition

Author :

Dumouchel, P.

Author_Institution :

INRS-Telecommun., Quebec Univ., Verdun, Que., Canada

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

We first propose to model microprosody by means of a Bayesian classifier assuming multivariate Gaussian distributions on suprasegmental features. Second, we normalize the suprasegmental features by using dynamic parameters extracted from diphones. Third, we examine three different types of covariance matrices and show that a full covariance matrix per diphone gives the best results. Finally, the insertion of the microprosodic model into the INRS large vocabulary speech recognition improves the word recognition slightly from 48% to 52%

Keywords :

Bayes methods; Gaussian distribution; Gaussian processes; covariance matrices; feature extraction; pattern classification; speech recognition; Bayesian classifier; INRS large vocabulary speech recognition; continuous speech recognition; covariance matrices; diphones; dynamic parameters; feature extraction; microprosodic model; microprosody; multivariate Gaussian distributions; suprasegmental features; word recognition; Delay; Disk recording; Face; Filters; Frequency; Hidden Markov models; Speech processing; Speech recognition; Stress; Viterbi algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389690

Filename :

389690

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290355