DocumentCode
1123061
Title
Acoustic-to-phonetic mapping using recurrent neural networks
Author
Hanes, Mark D. ; Ahalt, Stanley C. ; Krishnamurthy, Ashok K.
Author_Institution
Dept. of Electr. Eng., Ohio State Univ., Columbus, OH, USA
Volume
5
Issue
4
fYear
1994
fDate
7/1/1994 12:00:00 AM
Firstpage
659
Lastpage
662
Abstract
This paper describes the application of artificial neural networks to acoustic-to-phonetic mapping. The experiments described are typical of problems in speech recognition in which the temporal nature of the input sequence is critical. The specific task considered is that of mapping formant contours to the corresponding CVC´ syllable. We performed experiments on formant data extracted from the acoustic speech signal spoken at two different tempos (slow and normal) using networks based on the Elman simple recurrent network model. Our results show that the Elman networks used in these experiments were successful in performing the acoustic-to-phonetic mapping from formant contours. Consequently, we demonstrate that relatively simple networks, readily trained using standard backpropagation techniques, are capable of initial and final consonant discrimination and vowel identification for variable speech rates
Keywords
backpropagation; recurrent neural nets; speech recognition; Elman networks; acoustic speech signal; acoustic-to-phonetic mapping; backpropagation; consonant discrimination; formant contours mapping; recurrent neural networks; speech recognition; vowel identification; Artificial neural networks; Computer architecture; Computer networks; Data mining; Delay effects; Network synthesis; Neural networks; Recurrent neural networks; Speech recognition; Speech synthesis;
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/72.298235
Filename
298235
Link To Document