• DocumentCode
    2296972
  • Title

    Neural network boundary refining for automatic speech segmentation

  • Author

    Toledano, Doroteo

  • Author_Institution
    Telefonica Investigacion y Desarrollo, Madrid, Spain
  • Volume
    6
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    3438
  • Abstract
    This work is an extension of a previous work in which an automatic speech segmentation and labeling system was proposed based on a hidden Markov model (HMM) speech recognizer followed by a fuzzy-logic boundary correction system. In this paper we explore the possibility of substituting that difficult to design fuzzy-logic system by a neural network (NN) based system that can be automatically trained. First, the whole fuzzy-logic boundary correction system, which used different rule sets for each kind of phonetic transition, has been substituted by a single NN. Results show that this single NN outperforms the complete fuzzy-logic system. Then, the possibility of using different NNs specialized in each kind of phonetic transition has been explored. Results are again clearly better than the results obtained with the fuzzy-logic system, but not clearly better than the results obtained with just one NN
  • Keywords
    hidden Markov models; learning (artificial intelligence); multilayer perceptrons; speech processing; speech recognition; HMM speech recognizer; MLP; automatic speech labeling system; automatic speech segmentation; fuzzy-logic boundary correction system; fuzzy-logic system; hidden Markov model; multilayer perceptron; neural network boundary refining; phonetic transition; training; Automatic speech recognition; Databases; Electronic mail; Hidden Markov models; Humans; Labeling; Natural languages; Neural networks; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.860140
  • Filename
    860140