DocumentCode :
3411118
Title :
Segmentation of a speech spectrogram using mathematical morphology
Author :
Steinberg, Raphael ; Shaughnessy, Douglas O.
Author_Institution :
INRS-Telecommun., Montreal, QC
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
1637
Lastpage :
1640
Abstract :
It has been shown that speech spectrograms can be read by trained experts. In this work, we regard the speech spectrogram image as a written text in some unknown language and perform segmentation in order to capture the energy associated with each formant. We propose an algorithm based on Mathematical Morphology operators and mainly on the watershed transform. The result is robust segmentation for wideband speech spectrograms that can be later used for automatic speech recognition. We show results of experimental runs for different phoneme classes.
Keywords :
image segmentation; mathematical morphology; speech processing; speech recognition; automatic speech recognition; mathematical morphology operator; phoneme class; speech spectrogram image segmentation; watershed transform; wideband speech spectrogram; Automatic speech recognition; Data mining; Frequency; Image segmentation; Morphology; Optical filters; Skeleton; Spectrogram; Speech recognition; Wideband; Image segmentation; Morphological operations; Optical character recognition; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4517940
Filename :
4517940
Link To Document :
بازگشت