مرکز منطقه ای اطلاع رساني علوم و فناوري - Generation of Phonetic Units for Mixed-Language Speech Recognition Based on Acoustic and Contextual Analysis

DocumentCode :

1092944

Title :

Generation of Phonetic Units for Mixed-Language Speech Recognition Based on Acoustic and Contextual Analysis

Author :

Huang, Chien-Lin ; Wu, Chung-Hsien

Volume :

Issue :

fYear :

2007

Firstpage :

1225

Lastpage :

1233

Abstract :

This work presents a novel approach to generating phonetic units in order to recognize mixed-language or multilingual speech. Acoustic and contextual analysis is performed to characterize multilingual phonetic units for phone set creation. Acoustic likelihood is utilized for similarity estimation of phone models. The hyperspace analog to language (HAL) model is adopted for contextual modeling and contextual similarity estimation. A confusion matrix combining acoustic and contextual similarities between every two phonetic units is built for phonetic unit clustering. Multidimensional scaling (MDS) method is applied to the confusion matrix for reducing dimensionality. Experimental results indicate that the created phonetic set provides a compact and robust set that considers acoustic and contextual information for mixed-language or multilingual speech recognition.

Keywords :

Context modeling; Fusion power generation; Maximum likelihood estimation; Multidimensional systems; Natural languages; Performance analysis; Robustness; Speech analysis; Speech processing; Speech recognition; Mixed-language speech recognition; hyperspace analog to language; multidimensional scaling; phonetic unit;

fLanguage :

English

Journal_Title :

Computers, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9340

Type :

jour

DOI :

10.1109/TC.2007.1064

Filename :

4288089

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1092944