DocumentCode :
2506578
Title :
Rethinking Algorithm Design and Development in Speech Processing
Author :
Stadelmann, Thilo ; Wang, Yinghui ; Smith ; Ewerth, Ralph ; Freisleben, Bernd
Author_Institution :
Dept. of Math, & Comput. Sci., Univ. of Marburg, Marburg, Germany
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
4476
Lastpage :
4479
Abstract :
Speech processing is typically based on a set of complex algorithms requiring many parameters to be specified. When parts of the speech processing chain do not behave as expected, trial and error is often the only way to investigate the reasons. In this paper, we present a research methodology to analyze unexpected algorithmic behavior by making (intermediate) results of the speech processing chain perceivable and intuitively comprehensible by humans. The workflow of the process is explicated using a real-world example leading to considerable improvements in speaker clustering. The described methodology is supported by a software toolbox available for download.
Keywords :
speaker recognition; speech processing; algorithmic behavior; complex algorithms; process workflow; research methodology; rethinking algorithm design; software toolbox; speaker clustering; speech processing chain; Algorithm design and analysis; Context; Data visualization; Humans; Speech; Speech processing; Visualization; eidetic design; intuition; resynthesis; teaching; visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.1087
Filename :
5597381
Link To Document :
بازگشت