• DocumentCode
    1920565
  • Title

    Automatic segmentation and labeling of speech

  • Author

    Ljolje, A. ; Riley, M.D.

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    473
  • Abstract
    The authors investigate an automatic approach to segmentation of labeled speech and labeling and segmentation of speech when only the orthographic transcription of speech is available. The technique is based on a phone recognition system based on a trigram phonotactic model, gamma distribution phone duration models, and a spectral model based on five different structures for phone models of varying contextual dependencies. The alignment of speech with a given phone sequence is performed as a very constrained phone recognition task with the phonotactic model based only on the given phone sequence. When only orthographic transcription is provided, a classification-tree-based prediction of most likely phone realizations is used as an input network for the phone recognizer. The maximum likelihood phone sequence is then treated as the true phone sequence and its segment boundaries are compared with the reference boundaries
  • Keywords
    speech analysis and processing; classification-tree-based prediction; gamma distribution phone duration models; input network; labeled speech segmentation; maximum likelihood phone sequence; orthographic transcription; phone recognition; reference boundaries; segment boundaries; spectral model; trigram phonotactic model; varying contextual dependencies; Classification tree analysis; Context modeling; Databases; Humans; Labeling; Performance evaluation; Speech analysis; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150379
  • Filename
    150379