DocumentCode :
2409571
Title :
Comparative Study of Filter-Bank Mean-Energy Distance for Automated Segmentation of Speech Signals
Author :
Ananthakrishnan, G. ; Ranjani, H.G. ; Ramakrishnan, A.G.
Author_Institution :
Electr. Eng. Dept., Indian Inst. of Sci., Bangalore
fYear :
2007
fDate :
22-24 Feb. 2007
Firstpage :
6
Lastpage :
10
Abstract :
This paper describes a method of automated segmentation of speech assuming the signal is continuously time varying rather than the traditional short time stationary model. It has been shown that this representation gives comparable if not marginally better results than the other techniques for automated segmentation. A formulation of the ´Bach´ (music semitonal) frequency scale filter-bank is proposed. A comparative study has been made of the performances using Mel, Bark and Bach scale filter banks considering this model. The preliminary results show up to 80 % matches within 20 ms of the manually segmented data, without any information of the content of the text and without any language dependence. ´Bach´ filters are seen to marginally outperform the other filters
Keywords :
channel bank filters; speech processing; automated segmentation; filter-bank mean-energy distance; frequency scale filter-bank; speech signal; Band pass filters; Filter bank; Frequency; Hidden Markov models; Information filtering; Information filters; Matched filters; Music; Nonlinear filters; Speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, Communications and Networking, 2007. ICSCN '07. International Conference on
Conference_Location :
Chennai
Print_ISBN :
1-4244-0997-7
Electronic_ISBN :
1-4244-0997-7
Type :
conf
DOI :
10.1109/ICSCN.2007.350670
Filename :
4156573
Link To Document :
بازگشت