مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech recognition with character string encoding

DocumentCode :

2999558

Title :

Speech recognition with character string encoding

Author :

White, G.M.

Author_Institution :

Xerox Palo Alto Research Center, Palo Alto, California

fYear :

1972

fDate :

13-15 Dec. 1972

Firstpage :

111

Lastpage :

113

Abstract :

An isolated word recognition system that uses character string encoding is described that has achieved 98% correct recognition scores on limited vocabularies (20-54 words). Speaker normalization, word segmentation, and learning paradigms have been incorporated. Audio input passes through a 6-channel octave band pass filter bank. The output of each channel is time integrated for 10 ms, and log mapped. An utterance is represented by a succession of points (a new point is generated every 10 ms) in the 6- dimensional space defined by the 6 octave bands. Reference points are scattered throughout the space. Each time interval is assigned the label of the nearest reference point. We call the resulting string of labels a "character string". Encoding an utterance into a character string may proceed with an arbitrary degree of precision, greater resolution resulting from the use of more reference points. Only 24 reference points are needed to achieve 98% correct recognition scores for 54 words in near real time. String generation techniques are explored. Several learning schemes based on character strings are described. Finally, experiments with a software classifier that uses "deformable templates" based on character strings are presented.

Keywords :

Band pass filters; Character recognition; Encoding; Ferroelectric films; Filtering; Nonvolatile memory; Random access memory; Scattering; Speech recognition; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 1972 and 11th Symposium on Adaptive Processes. Proceedings of the 1972 IEEE Conference on

Conference_Location :

New Orleans, Louisiana, USA

Type :

conf

DOI :

10.1109/CDC.1972.268956

Filename :

4044879

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2999558