DocumentCode :
840962
Title :
The SIGMA Algorithm: A Glottal Activity Detector for Electroglottographic Signals
Author :
Thomas, Mark R P ; Naylor, Patrick A.
Author_Institution :
Dept. of Electr. & Electron. Eng., Imperial Coll. London, London, UK
Volume :
17
Issue :
8
fYear :
2009
Firstpage :
1557
Lastpage :
1566
Abstract :
Accurate estimation of glottal closure instants (GCIs) and opening instants (GOIs) is important for speech processing applications that benefit from glottal-synchronous processing. The majority of existing approaches detect GCIs by comparing the differentiated EGG signal to a threshold and are able to provide accurate results during voiced speech. More recent algorithms use a similar approach across multiple dyadic scales using the stationary wavelet transform. All existing approaches are however prone to errors around the transition regions at the end of voiced segments of speech. This paper describes a new method for EGG-based glottal activity detection which exhibits high accuracy over the entirety of voiced segments, including, in particular, the transition regions, thereby giving significant improvement over existing methods. Following a stationary wavelet transform-based preprocessor, detection of excitation due to glottal closure is performed using a group delay function and then true and false detections are discriminated by Gaussian mixture modeling. GOI detection involves additional processing using the estimated GCIs. The main purpose of our algorithm is to provide a ground-truth for GCIs and GOIs. This is essential in order to evaluate algorithms that estimate GCIs and GOIs from the speech signal only, and is also of high value in the analysis of pathological speech where knowledge of GCIs and GOIs is often needed. We compare our algorithm with two previous algorithms against a hand-labeled database. Evaluation has shown an average GCI hit rate of 99.47% and GOI of 99.35%, compared to 96.08 and 92.54 for the best-performing existing algorithm.
Keywords :
Gaussian processes; medical signal processing; speech processing; wavelet transforms; EGG signal; Gaussian mixture modeling; SIGMA algorithm; electroglottographic signal; glottal activity detector; glottal closure instant estimation; glottal opening instant estimation; glottal-synchronous processing; group delay function; hand-labeled database; multiscale analysis algorithm; pathological speech analysis; speech processing application; stationary wavelet transform; Electroglottograph (EGG); glottal closure instants (GCIs); group delay function; laryngograph;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2009.2022430
Filename :
4912310
Link To Document :
بازگشت