مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning vocal tract variables with multi-task kernels

DocumentCode :

2162878

Title :

Learning vocal tract variables with multi-task kernels

Author :

Kadri, Hachem ; Duflos, Emmanuel ; Preux, Philippe

Author_Institution :

Team-Project SequeL, INRIA Lille Nord-Eur., Villeneuve-d´´Ascq, France

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

2200

Lastpage :

2203

Abstract :

The problem of acoustic-to-articulatory speech inversion continues to be a challenging research problem which significantly impacts automatic speech recognition robustness and accuracy. This paper presents a multi-task kernel based method aimed at learning Vocal Tract (VT) variables from the Mel-Frequency Cepstral Coefficients (MFCCs). Unlike usual speech inversion techniques based on individual estimation of each tract variable, the key idea here is to consider all the target variables simultaneously to take advantage of the relationships among them and then improve learning performance. The proposed method is evaluated using synthetic speech dataset and corresponding tract variables created by the TAsk Dynamics Application (TADA) model and compared to the hierarchical ε-SVR speech inversion technique.

Keywords :

cepstral analysis; speech recognition; ε-SVR speech inversion technique; MFCC; TADA model; acoustic-to-articulatory speech inversion; automatic speech recognition accuracy; automatic speech recognition robustness; learning VT variable; learning vocal tract variable; melfrequency cepstral coefficient; multitask kernel based method; synthetic speech dataset; task dynamics application model; Hilbert space; Kernel; Machine learning; Mel frequency cepstral coefficient; Speech; Speech recognition; Multi-task learning; acoustic-to-articulatory inversion; matrix-valued kernel; vocal tract variables;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5946917

Filename :

5946917

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2162878