• DocumentCode
    302304
  • Title

    A fast stochastic parser for determining phrase boundaries for text-to-speech synthesis

  • Author

    Sharman, Richard A. ; Wright, Jerry H.

  • Author_Institution
    IBM UK Labs. Ltd., Winchester, UK
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    357
  • Abstract
    A stochastic parser is described which creates a phrase structure for a tagged sentence on the basis of statistical information inferred from a manually-bracketed training corpus. The information employed consists of measured probabilities for tag unigrams, symbol bigrams, bracket enclosures, bracket opening and closing, and length distribution. For experimental purposes a tree-search algorithm is used to find the highest-scoring bracketing, and a tree metric is used to measure the accuracy of the results for a test corpus. Finally, a fast algorithm for implementation is based on a finite-state approximation to the tree-search algorithm. Using these procedures, a gross level of syntactic structure is found quickly, with the main aim being that of pause insertion in real-time text-to-speech systems
  • Keywords
    finite state machines; natural languages; speech synthesis; statistical analysis; stochastic processes; tree searching; bracket closing; bracket enclosure; bracket opening; fast stochastic parser; finite-state approximation; length distribution; manually-bracketed training corpus; pause insertion; phrase boundaries; phrase structure; probabilities; statistical information; symbol bigrams; syntactic structure; tag unigrams; tagged sentence; text-to-speech synthesis; tree metric; tree-search algorithm; Approximation algorithms; Laboratories; Length measurement; Mathematics; Probability; Real time systems; Speech analysis; Speech synthesis; Stochastic processes; Winches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.541106
  • Filename
    541106