DocumentCode
2296972
Title
Neural network boundary refining for automatic speech segmentation
Author
Toledano, Doroteo
Author_Institution
Telefonica Investigacion y Desarrollo, Madrid, Spain
Volume
6
fYear
2000
fDate
2000
Firstpage
3438
Abstract
This work is an extension of a previous work in which an automatic speech segmentation and labeling system was proposed based on a hidden Markov model (HMM) speech recognizer followed by a fuzzy-logic boundary correction system. In this paper we explore the possibility of substituting that difficult to design fuzzy-logic system by a neural network (NN) based system that can be automatically trained. First, the whole fuzzy-logic boundary correction system, which used different rule sets for each kind of phonetic transition, has been substituted by a single NN. Results show that this single NN outperforms the complete fuzzy-logic system. Then, the possibility of using different NNs specialized in each kind of phonetic transition has been explored. Results are again clearly better than the results obtained with the fuzzy-logic system, but not clearly better than the results obtained with just one NN
Keywords
hidden Markov models; learning (artificial intelligence); multilayer perceptrons; speech processing; speech recognition; HMM speech recognizer; MLP; automatic speech labeling system; automatic speech segmentation; fuzzy-logic boundary correction system; fuzzy-logic system; hidden Markov model; multilayer perceptron; neural network boundary refining; phonetic transition; training; Automatic speech recognition; Databases; Electronic mail; Hidden Markov models; Humans; Labeling; Natural languages; Neural networks; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.860140
Filename
860140
Link To Document