• DocumentCode
    1080295
  • Title

    Investigation of an efficient representation of speech spectra for segmentation and classification of speech sounds

  • Author

    Beninghof, W. ; Ross, Myron J.

  • Author_Institution
    Northeastern University, Boston, Mass
  • Volume
    18
  • Issue
    1
  • fYear
    1970
  • fDate
    3/1/1970 12:00:00 AM
  • Firstpage
    33
  • Lastpage
    42
  • Abstract
    A functional representation of speech sounds in orthogonal polynomial space is described and preliminary results are presented. Speech spectra are approximated by a linear combination of orthogonal polynomials which are found to be more efficient than a linear combination of trigonometric functions. The original spectra (100 samples in frequency) and the polynomial approximations are represented by points in their respective Hilbert spaces, the distance between successive points being a measure of the dissimilarity of successive spectra. Segment boundaries are indicated where the distance between successive spectra exceeds a threshold. The effectiveness in segmentation of connected utterances using these spectral forms is compared. Also, representing speech in orthogonal polynomial space appears to be applicable to clustering and separating transformations which yield simple decision boundaries for phoneme classification. Although only one polynomial class is investigated, the procedure is valid for other functional representations of speech data.
  • Keywords
    Acoustical engineering; Hilbert space; Linear approximation; NASA; Polynomials; Speech; Winches;
  • fLanguage
    English
  • Journal_Title
    Audio and Electroacoustics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9278
  • Type

    jour

  • DOI
    10.1109/TAU.1970.1162077
  • Filename
    1162077