Title :
Investigation of an efficient representation of speech spectra for segmentation and classification of speech sounds
Author :
Beninghof, W. ; Ross, Myron J.
Author_Institution :
Northeastern University, Boston, Mass
fDate :
3/1/1970 12:00:00 AM
Abstract :
A functional representation of speech sounds in orthogonal polynomial space is described and preliminary results are presented. Speech spectra are approximated by a linear combination of orthogonal polynomials which are found to be more efficient than a linear combination of trigonometric functions. The original spectra (100 samples in frequency) and the polynomial approximations are represented by points in their respective Hilbert spaces, the distance between successive points being a measure of the dissimilarity of successive spectra. Segment boundaries are indicated where the distance between successive spectra exceeds a threshold. The effectiveness in segmentation of connected utterances using these spectral forms is compared. Also, representing speech in orthogonal polynomial space appears to be applicable to clustering and separating transformations which yield simple decision boundaries for phoneme classification. Although only one polynomial class is investigated, the procedure is valid for other functional representations of speech data.
Keywords :
Acoustical engineering; Hilbert space; Linear approximation; NASA; Polynomials; Speech; Winches;
Journal_Title :
Audio and Electroacoustics, IEEE Transactions on
DOI :
10.1109/TAU.1970.1162077