Title :
Development of Japanese voice-activated word processor using isolated monosyllable recognition
Author :
Nitta, T. ; Murata, T. ; Tsuboi, H. ; Takeda, K. ; Kawada, T. ; Watanabe, S.
Author_Institution :
Toshiba Research and Development Center, Kawasaki, Japan
Abstract :
This paper describes a newly developed voice-activated word processor and a two-stage recognition method to achieve a precise recognition of isolated monosyllables. At the first stage, the recognizer segments a monosyllable into an initial consonantal part and a final part (i.e., the vowel region), and computes similarities between the input speech and orthonormal mode functions of each consonantal segment which is designed from multiple speakers using K-L expansion and adapted to a new speaker ( Adaptive Multiple Similarity Method). At the second stage, frame-by-frame similarity scores, extracted at the phoneme recognizer using Multiple Similarity Method, are applied to candidate monosyllables to make a final decision. The average monosyllable recognition accuracy with six speakers was about 95%.
Keywords :
Commercialization; Concatenated codes; Equations; Frequency; Natural languages; Pattern matching; Pattern recognition; Prototypes; Speech processing; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
DOI :
10.1109/ICASSP.1982.1171875