مرکز منطقه ای اطلاع رساني علوم و فناوري - A phonetic feature based lattice rescoring approach to LVCSR

DocumentCode :

3527373

Title :

A phonetic feature based lattice rescoring approach to LVCSR

Author :

Siniscalchi, Sabato Marco ; Svendsen, Torbjórn ; Lee, Chin-Hui

Author_Institution :

Dept. of Electron. & Telecommun., Norwegian Univ. of Sci. & Technol., Trondheim

fYear :

2009

fDate :

19-24 April 2009

Firstpage :

3865

Lastpage :

3868

Abstract :

Large Vocabulary Continuous Speech Recognition (LVCSR) systems decode the input speech using diverse information sources, such as acoustic, lexical, and linguistic. Although most of the unreliable hypotheses are pruned during the recognition process, current state-of-the-art systems often make errors that are ldquounreasonablerdquo for human listeners. Several studies have shown that a proper integration of acoustic-phonetic information can be beneficial to reducing such errors. We have previously shown that high-accuracy phone recognition can be achieved if a bank of speech attribute detectors is used to compute a confidence score describing attribute activation levels that the current frame exhibits. In those experiments, the phone recognition system did not rely on the language model to follow their word sequence constraints, and the vocabulary was small. In this work, we extend our approach to LVCSR by introducing a second recognition step during which additional information not directly used during conventional log-likelihood based decoding is introduced. Experimental results show promising performance.

Keywords :

decoding; speech recognition; large vocabulary continuous speech recognition systems; log likelihood decoding; phone recognition; phonetic feature lattice rescoring; speech attribute detector; Acoustic signal detection; Artificial neural networks; Automatic speech recognition; Decoding; Detectors; Hidden Markov models; Humans; Lattices; Speech recognition; Vocabulary; Detectors; neural networks; speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location :

Taipei

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2009.4960471

Filename :

4960471

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3527373