مرکز منطقه ای اطلاع رساني علوم و فناوري - Recent developments in large vocabulary continuous speech recognition

DocumentCode :

590703

Title :

Recent developments in large vocabulary continuous speech recognition

Author :

Saon, George ; Jen-Tzung Chien

Author_Institution :

IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA

fYear :

2012

fDate :

3-6 Dec. 2012

Firstpage :

Lastpage :

Abstract :

This paper overviews a series of recent approaches to front-end processing, acoustic modeling, language modeling, and back-end search and system combination which have made contributions for large vocabulary continuous speech recognition (LVCSR) systems. These approaches include the feature transformations, speaker-adaptive features, and discriminative features in front-end processing, the feature-space and model-space discriminative training, deep neural networks, and speaker adaptation in acoustic modeling, the backoff smoothing, large-span modeling, and model regularization in language modeling, and the system combination, cross-adaptation, and boosting in search and system combination. Some future directions for LVCSR research are also addressed.

Keywords :

feature extraction; neural nets; speech recognition; LVCSR; acoustic modeling; back-end search; discriminative features; feature transformations; front-end processing; language modeling; large vocabulary continuous speech recognition; model regularization; neural networks; speaker adaptation; speaker-adaptive features; Acoustics; Adaptation models; Data models; Hidden Markov models; Speech; Training; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific

Conference_Location :

Hollywood, CA

Print_ISBN :

978-1-4673-4863-8

Type :

conf

Filename :

6411850

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=590703