DocumentCode :
3028847
Title :
Speech segmentation using multifractal measures and amplification of signal features
Author :
Kinsner, Witold ; Grieder, Warren
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Manitoba, Winnipeg, MB
fYear :
2008
fDate :
14-16 Aug. 2008
Firstpage :
351
Lastpage :
357
Abstract :
This paper describes a fast multiscale time-domain technique for the analysis of natural speech waveforms in the presence of noise. The technique is based on the variance fractal dimension trajectory algorithm that is used not only to detect the external boundaries of an utterance, but also its internal pauses representing the unvoiced speech. The algorithm can also identify internal features of phonemes. The features can be amplified so that the speech utterances can be segmented into sentences, words and phonemes. This approach is superior to other energy-based boundary-detection techniques. These observations are based on extensive experimental results on speech utterances digitized at 44.1 kilosamples per second, with 16 bits in each sample.
Keywords :
fractals; speech processing; time-domain analysis; waveform analysis; multifractal measures; multiscale time-domain technique; natural speech waveforms; phonemes; signal feature amplification; speech segmentation; speech utterance; variance fractal dimension trajectory; Acoustic noise; Automatic speech recognition; Feature extraction; Fractals; Linear predictive coding; Natural languages; Pattern matching; Signal processing algorithms; Speech analysis; Speech recognition; Fractal measures; feature amplification; fractal feature extraction; fractal variance dimension trajectory; speech utterance segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cognitive Informatics, 2008. ICCI 2008. 7th IEEE International Conference on
Conference_Location :
Stanford, CA
Print_ISBN :
978-1-4244-2538-9
Type :
conf
DOI :
10.1109/COGINF.2008.4639188
Filename :
4639188
Link To Document :
بازگشت