مرکز منطقه ای اطلاع رساني علوم و فناوري - Analysis-by-synthesis features for speech recognition

DocumentCode :

3422933

Title :

Analysis-by-synthesis features for speech recognition

Author :

Bawab, Ziad Al ; Raj, Bhiksha ; Stern, Richard M.

Author_Institution :

Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA

fYear :

2008

fDate :

March 31 2008-April 4 2008

Firstpage :

4185

Lastpage :

4188

Abstract :

We present a framework for speech recognition that accounts for hidden articulatory information. We model the articulatory space using a codebook of articulatory configurations geometrically derived from EMA measurements available in the MOCHA database. The articulatory parameter set we derive is in the form of Maeda parameters. In turn, these parameters are used in a physiologically- motivated articulatory speech synthesizer based on the model by Sondhi and Schroeter. We use the distortion between the speech synthesized from each of the articulatory configurations and the original speech as features for recognition. We setup a segmented phoneme recognition task on the MOCHA database using Gaussian mixture models (GMMs). Improvements are achieved when combining the probability scores generated using the distortion features with the scores using acoustic features.

Keywords :

Gaussian processes; speech recognition; speech synthesis; Gaussian mixture models; MOCHA database; analysis-by-synthesis features; hidden articulatory information; physiologically-motivated articulatory speech synthesizer; segmented phoneme recognition task; speech recognition; Acoustic distortion; Cepstral analysis; Physics; Signal generators; Signal synthesis; Spatial databases; Speech analysis; Speech recognition; Speech synthesis; Trajectory; Articulatory Recognition; Articulatory Synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location :

Las Vegas, NV

ISSN :

1520-6149

Print_ISBN :

978-1-4244-1483-3

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2008.4518577

Filename :

4518577

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3422933