Acoustic-to-phonetic mapping using recurrent neural networks

Author

Hanes, Mark D. ; Ahalt, Stanley C. ; Krishnamurthy, Ashok K.

Author_Institution

Dept. of Electr. Eng., Ohio State Univ., Columbus, OH, USA

Volume

5

Issue

4

fYear

1994

fDate

7/1/1994 12:00:00 AM

Firstpage

659

Lastpage

662

Abstract

This paper describes the application of artificial neural networks to acoustic-to-phonetic mapping. The experiments described are typical of problems in speech recognition in which the temporal nature of the input sequence is critical. The specific task considered is that of mapping formant contours to the corresponding CVC´ syllable. We performed experiments on formant data extracted from the acoustic speech signal spoken at two different tempos (slow and normal) using networks based on the Elman simple recurrent network model. Our results show that the Elman networks used in these experiments were successful in performing the acoustic-to-phonetic mapping from formant contours. Consequently, we demonstrate that relatively simple networks, readily trained using standard backpropagation techniques, are capable of initial and final consonant discrimination and vowel identification for variable speech rates

Keywords

backpropagation; recurrent neural nets; speech recognition; Elman networks; acoustic speech signal; acoustic-to-phonetic mapping; backpropagation; consonant discrimination; formant contours mapping; recurrent neural networks; speech recognition; vowel identification; Artificial neural networks; Computer architecture; Computer networks; Data mining; Delay effects; Network synthesis; Neural networks; Recurrent neural networks; Speech recognition; Speech synthesis;

fLanguage

English

Journal_Title

Neural Networks, IEEE Transactions on

Publisher

ieee

ISSN

1045-9227

Type

jour

DOI

10.1109/72.298235

Filename

298235