DocumentCode
1080295
Title
Investigation of an efficient representation of speech spectra for segmentation and classification of speech sounds
Author
Beninghof, W. ; Ross, Myron J.
Author_Institution
Northeastern University, Boston, Mass
Volume
18
Issue
1
fYear
1970
fDate
3/1/1970 12:00:00 AM
Firstpage
33
Lastpage
42
Abstract
A functional representation of speech sounds in orthogonal polynomial space is described and preliminary results are presented. Speech spectra are approximated by a linear combination of orthogonal polynomials which are found to be more efficient than a linear combination of trigonometric functions. The original spectra (100 samples in frequency) and the polynomial approximations are represented by points in their respective Hilbert spaces, the distance between successive points being a measure of the dissimilarity of successive spectra. Segment boundaries are indicated where the distance between successive spectra exceeds a threshold. The effectiveness in segmentation of connected utterances using these spectral forms is compared. Also, representing speech in orthogonal polynomial space appears to be applicable to clustering and separating transformations which yield simple decision boundaries for phoneme classification. Although only one polynomial class is investigated, the procedure is valid for other functional representations of speech data.
Keywords
Acoustical engineering; Hilbert space; Linear approximation; NASA; Polynomials; Speech; Winches;
fLanguage
English
Journal_Title
Audio and Electroacoustics, IEEE Transactions on
Publisher
ieee
ISSN
0018-9278
Type
jour
DOI
10.1109/TAU.1970.1162077
Filename
1162077
Link To Document