مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust spoken language identification using large vocabulary speech recognition

DocumentCode :

310589

Title :

Robust spoken language identification using large vocabulary speech recognition

Author :

Hieronymus, James L. ; Kadambe, Shubha

Author_Institution :

Bell Labs., Murray Hill, NJ, USA

Volume :

fYear :

1997

fDate :

21-24 Apr 1997

Firstpage :

1111

Abstract :

A robust, task independent spoken language identification (LID) system which uses a large vocabulary continuous speech recognition (LVCSR) module for each language to choose the most likely language spoken is described. The acoustic analysis uses mean cepstral removal on mel scale cepstral coefficients to compensate for different input channels. The system has been trained on 5 languages: English, German, Japanese, Mandarin Chinese and Spanish using a subset of the Oregon Graduate Institute 11 language data base. The five language results show 88% correct recognition for 50 second utterances without using confidence measures and 98% correct with confidence measures without the robust front end. The recognition rate is 81% correct for 10 second utterances without confidence measures and 93% correct with confidence measures without the robust front end. Adding the robust front end improves the recognition rate approximately 3% on the short utterances and 1% for the long utterances. The best performance has been obtained for systems trained on phonetically hand labeled speech

Keywords :

cepstral analysis; natural languages; speech recognition; English; German; Japanese; Mandarin Chinese; Spanish; acoustic analysis; cepstral removal; confidence measures; continuous speech recognition; input channels; large vocabulary speech recognition; mel scale cepstral coefficients; performance; phonetically hand labeled speech; recognition rate; robust spoken language identification; task independent spoken language identification; utterances; Cepstral analysis; Computer networks; Databases; Natural languages; Robustness; Speech recognition; System testing; Telephony; Training data; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

Conference_Location :

Munich

ISSN :

1520-6149

Print_ISBN :

0-8186-7919-0

Type :

conf

DOI :

10.1109/ICASSP.1997.596136

Filename :

596136

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=310589