Title :
Scale-space expansion of acoustic features improves speech event detection
Author :
Ryant, Neville ; Jiahong Yuan ; Liberman, Mark
Author_Institution :
Linguistic Data Consortium, Univ. of Pennsylvania, Philadelphia, PA, USA
Abstract :
In a system for detecting and measuring phonetic events (here bursts, voice onsets, and voice-onset times), we show that the addition of features smoothed at multiple scales can improve both recall (the proportion of events correctly identified) and measurement accuracy (the timing of events and the difference between event times, relative to expert human judgments). Multi-scale (or “scale space”) features had an especially strong positive effect on robustness across datasets with different materials and recording conditions. Standard machine-learning classifiers were able to integrate information across scales, without any special treatment of the multi-scale features.
Keywords :
feature extraction; learning (artificial intelligence); signal classification; signal detection; speech recognition; acoustic features; event times; expert human judgments; here bursts; machine learning classifiers; measurement accuracy; multiscale features; phonetic events; scale space features; scale-space expansion; speech event detection; voice onsets; voice-onset times; Acoustic measurements; Acoustics; Detectors; Feature extraction; Image edge detection; Kernel; Speech; automated phonetic measurement; scale space; voice onset time;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6638956