DocumentCode :
642520
Title :
Statistics based features for unvoiced sound classification
Author :
Sivasankaran, Shiju ; Prabhu, K.M.M.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Madras, Chennai, India
fYear :
2013
fDate :
22-25 Sept. 2013
Firstpage :
1
Lastpage :
6
Abstract :
Unvoiced phonemes have significant presence in spoken English language. These phonemes are hard to classify, due to their weak energy and lack of periodicity. Sound textures such as sound made by a flowing stream of water or falling droplets of rain have similar aperiodic properties in temporal domain as unvoiced phonemes. These sounds are easily differentiated by a human ear. Recent studies on sound texture analysis and synthesis have shown that the human auditory system perceives sound textures using simple statistics. These statistics are obtained by decomposing sounds using a set of filter-banks and computing the moments of the filter responses, along with their correlation values. In this work we investigate if the above mentioned statistics, which are easy to extract, can also be used as features for classifying unvoiced sounds. To incorporate the moments and correlation values as features, a framework containing multiple classifiers is proposed. Experiments conducted on the TIMIT dataset gave an accuracy on par with the latest reported in the literature, with lesser computational cost.
Keywords :
channel bank filters; correlation methods; natural language processing; signal classification; speech processing; statistical analysis; TIMIT dataset; aperiodic properties; correlation values; filter responses; filter-banks; human auditory system; human ear; sound decomposition; sound texture analysis; sound texture synthesis; spoken English language; statistics based features; temporal domain; unvoiced phonemes; unvoiced sound classification; Accuracy; Auditory system; Correlation; Ear; Feature extraction; Modulation; Noise; Gaussian Mixture Model (GMM); Linear Prediction Coefficients (LPC); Unvoiced phonemes; features; sound textures; statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2013 IEEE International Workshop on
Conference_Location :
Southampton
ISSN :
1551-2541
Type :
conf
DOI :
10.1109/MLSP.2013.6661986
Filename :
6661986
Link To Document :
بازگشت