DocumentCode
3253987
Title
Asynchronous translations with recurrent neural nets
Author
Neco, Ramón P. ; Forcada, Mikel L.
Author_Institution
Dept. de Llenguatges i Sistemes Inf., Alacanti Univ., Spain
Volume
4
fYear
1997
fDate
9-12 Jun 1997
Firstpage
2535
Abstract
Many researchers have explored the relation between discrete-time recurrent neural networks (DTRNN) and finite-state machines (FSMs) either by showing their computational equivalence or by training them to perform as finite-state recognizers from examples. Most of this work has focused on the simplest class of deterministic state machines, that is deterministic finite automata and Mealy (or Moore) machines. The class of translations these machines can perform is very limited, mainly because these machines output symbols at the same rate as they input symbols, and therefore, the input and the translation have the same length; one may call these translations synchronous. Real-life translations are more complex: word reorderings, deletions, and insertions are common in natural-language translations; or, in speech-to-phoneme conversion, the number of frames corresponding to each phoneme is different and depends on the particular speaker or word. There are, however, simple deterministic, finite-state machines (extensions of Mealy machines) that may perform these classes of “asynchronous” or “time-warped” translations. A simple DTRNN model with input and output control lines inspired on this class of machines is presented and successfully applied to simple asynchronous translation tasks with interesting results regarding generalization. Training of these nets from input-output pairs is complicated by the fact that the time alignment between the target output sequence and the input sequence is unknown and has to be learned: we propose a new error function to tackle this problem. This approach to the induction of asynchronous translators is discussed in connection with other approaches
Keywords
finite state machines; generalisation (artificial intelligence); learning (artificial intelligence); multilayer perceptrons; recurrent neural nets; Mealy machines; Moore machines; asynchronous translations; computational equivalence; deterministic finite automata; deterministic finite-state machines; discrete-time recurrent neural networks; finite-state machines; natural-language translations; recurrent neural nets; speech-to-phoneme conversion; time-warped translations; word reorderings; Automata; Computer networks; Hafnium; Inference algorithms; Informatics; Recurrent neural networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks,1997., International Conference on
Conference_Location
Houston, TX
Print_ISBN
0-7803-4122-8
Type
conf
DOI
10.1109/ICNN.1997.614693
Filename
614693
Link To Document