مرکز منطقه ای اطلاع رساني علوم و فناوري - Construction of weighted finite state transducers for very wide context-dependent acoustic models

DocumentCode :

2875141

Title :

Construction of weighted finite state transducers for very wide context-dependent acoustic models

Author :

Schuster, Mike ; Hori, Takaaki

Author_Institution :

NTT Commun. Sci. Lab., NTT Corp., Kyoto

fYear :

2005

fDate :

27-27 Nov. 2005

Firstpage :

162

Lastpage :

167

Abstract :

A previous paper by the authors described an algorithm for efficient construction of weighted finite state transducers for speech recognition when high-order context-dependent models of order K > 3 (triphones) with tied state observation distributions are used, and showed practical application of the algorithm up to K = 5 (quinphones). In this paper we give additional details of the improved implementation and analyze the algorithm´s practical runtime requirements and memory footprint for context-orders up to K = 13 (+/-6 phones context) when building fully cross-word capable WFSTs for large vocabulary speech recognition tasks. We show that for typical systems it is possible to use any practical context-order K les 13 without having to fear an exponential explosion of the search space, since the necessary state ID to phone transducer (resembling a phone-loop observing all possible K-phone constraints) can be built in a few minutes at most. The paper also gives some implementation details of how we efficiently collect context statistics and build phonetic decision trees for very wide context-dependent acoustic models

Keywords :

acoustic transducers; decision trees; finite state machines; speech recognition; speech synthesis; finite state transducers; memory footprint; phonetic decision trees; state observation distributions; vocabulary speech recognition; wide context-dependent acoustic models; Acoustic transducers; Algorithm design and analysis; Context modeling; Decision trees; Explosions; Runtime; Speech analysis; Speech recognition; Statistics; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on

Conference_Location :

San Juan

Print_ISBN :

0-7803-9478-X

Electronic_ISBN :

0-7803-9479-8

Type :

conf

DOI :

10.1109/ASRU.2005.1566482

Filename :

1566482

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2875141