مرکز منطقه ای اطلاع رساني علوم و فناوري - Classification and recognition with direct segment models

DocumentCode :

3162243

Title :

Classification and recognition with direct segment models

Author :

Zweig, Geoffrey

Author_Institution :

Microsoft Res., Redmond, WA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4161

Lastpage :

4164

Abstract :

Segment based direct models have recently been used to improve the output of existing state-of-the-art speech recognizers. To date, however, they have relied on an existing HMM system to provide segment boundaries. This paper takes initial steps at using these models on their own, first by developing a segment-based maximum entropy phone classifier, and then by utilizing the features in a segmental conditional random field for recognition. To produce a feature representation that is independent of segment length, we utilize a set of ngram features based on vector-quantized representations of the acoustic input. We find that the models are able to integrate information at different granularities and from different streams. Contextual information from around the segment boundaries is particularly important. We obtain competitive results for TIMIT phone classification, and present initial recognition results.

Keywords :

entropy; feature extraction; hidden Markov models; speech recognition; vector quantisation; TIMIT phone classification; contextual information; direct segment models; existing HMM system; feature representation; phone classifier; segment boundaries; segment-based maximum entropy; segmental conditional random recognition field; state-of-the-art speech recognizers; vector-quantized representations; Acoustics; Context; Entropy; Error analysis; Hidden Markov models; Speech; Speech recognition; Maximum Entropy; Segmental Conditional Random Fields; Speech Recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288835

Filename :

6288835

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3162243