مرکز منطقه ای اطلاع رساني علوم و فناوري - Design of a linguistic statistical decoder for the recognition of continuous speech

DocumentCode :

923353

Title :

Design of a linguistic statistical decoder for the recognition of continuous speech

Author :

Jelinek, Frederick ; Bahl, Lalit R. ; Mercer, Robert L.

Volume :

Issue :

fYear :

1975

fDate :

5/1/1975 12:00:00 AM

Firstpage :

250

Lastpage :

256

Abstract :

Most current attempts at automatic speech recognition are formulated in an artificial intelligence framework. In this paper we approach the problem from an information-theoretic point of view. We describe the overall structure of a linguistic statistical decoder (LSD) for the recognition of continuous speech. The input to the decoder is a string of phonetic symbols estimated by an acoustic processor (AP). For each phonetic string, the decoder finds the most likely input sentence. The decoder consists of four major subparts: 1) a statistical model of the language being recognized; 2) a phonemic dictionary and statistical phonological rules characterizing the speaker; 3) a phonetic matching algorithm that computes the similarity between phonetic strings, using the performance characteristics of the AP; 4) a word level search control. The details of each of the subparts and their interaction during the decoding process are discussed.

Keywords :

Decoding; Speech recognition; Artificial intelligence; Automatic control; Automatic speech recognition; Character recognition; Decoding; Dictionaries; Loudspeakers; Natural languages; Speech recognition; Vocabulary;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/TIT.1975.1055384

Filename :

1055384

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=923353