مرکز منطقه ای اطلاع رساني علوم و فناوري - A comparison of waveform fractal dimension techniques for voice pathology classification

DocumentCode :

3163821

Title :

A comparison of waveform fractal dimension techniques for voice pathology classification

Author :

Baljekar, Pallavi N. ; Patil, Hemant A.

Author_Institution :

Dept. of Electron. & Commun., Manipal Inst. of Technol. (MIT), Manipal, India

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4461

Lastpage :

4464

Abstract :

In this paper, an attempt is made to compare and analyze the various waveform fractal dimension techniques for voice pathology classification. Three methods of estimating the fractal dimension directly from the time-domain waveform have been compared. The methods used are Katz algorithm, Higuchi algorithm and the Hurst exponent calculated using the rescaled range (R/S) analysis. Furthermore, the effects of the window size, the base waveform used and score-level fusion with Mel frequency cepstral coefficients (MFCC) has also been evaluated. The features have been extracted from two different base waveforms, the speech signal and the Teager energy operator (TEO) phase of the speech signal. Experiments have been carried out on a subset of the Massachusetts Eye and Ear Infirmary (MEEI) database and classifier used is a 2^nd order polynomial classifier. A classification accuracy of 97.54 %was achieved on score-level fusion, an increase in performance by about 2 % as compared to MFCC alone.

Keywords :

cepstral analysis; feature extraction; medical signal processing; polynomials; sensor fusion; signal classification; speech processing; time-domain analysis; waveform analysis; Higuchi algorithm; Hurst exponent; Katz algorithm; MEEI database; MFCC; Massachusetts eye and ear infirmary database; Mel frequency cepstral coefficients; TEO phase; Teager energy operator; classification accuracy; feature extraction; fractal dimension estimating methods; rescaled range analysis; score-level fusion; second order polynomial classifier; speech signal; time-domain waveform; voice pathology classification; waveform fractal dimension techniques; window size effects; Fractal dimension; Higuchi algorithm; Hurst exponent; Polynomial classifier; Voice Pathology;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288910

Filename :

6288910

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3163821