مرکز منطقه ای اطلاع رساني علوم و فناوري - Coping with out-of-vocabulary words: Open versus huge vocabulary asr

DocumentCode :

3529191

Title :

Coping with out-of-vocabulary words: Open versus huge vocabulary asr

Author :

Gerosa, Matteo ; Federico, Marcello

Author_Institution :

FBK-irst - Fondazione Bruno Kessler, Povo

fYear :

2009

fDate :

19-24 April 2009

Firstpage :

4313

Lastpage :

4316

Abstract :

This paper investigates methods for coping with out-of-vocabulary words in a large vocabulary speech recognition task, namely the automatic transcription of Italian broadcast news. Two alternative ways for augmenting a 64 K(thousand)-word recognition vocabulary and language model are compared: introducing extra words with their phonetic transcription up to 1.2 M (million) words, or extending the language model with so-called graphones, i.e. subword units made of phone-character sequences. Graphones and phonetic transcriptions of words are automatically generated by adapting an off-the-shelf statistical machine translation toolkit. We found that the word-based and graphone based extentions allow both for better recognition performance, with the former performing significantly better than the latter. In addition, the word-based extension approach shows interesting potential even under conditions of little supervision. In fact, by training the grapheme to phoneme translation system with only 2 K manually verified transcriptions, the final word error rate increases by just 3% relative, with respect to starting from a lexicon of 64 K words.

Keywords :

language translation; speech processing; speech recognition; statistical analysis; Italian broadcast news; automatic transcription; language model; out-of-vocabulary word; phoneme translation system; phonetic transcription; statistical machine translation toolkit; vocabulary speech recognition; word error rate; word recognition; Art; Automatic speech recognition; Broadcasting; Documentation; Error analysis; Natural languages; Robustness; Speech recognition; Training data; Vocabulary; Automatic Speech Recognition; OOV words; Open-vocabulary speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location :

Taipei

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2009.4960583

Filename :

4960583

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3529191