DocumentCode :
476826
Title :
Insertion reduction in speech segmentation using neural network
Author :
Salam, M.-S. ; Mohamad, Dzulkifli ; Salleh, Sh-Hussain
Author_Institution :
Fac. of Comput. Sci. & Inf. Syst., Univ. Teknol. Malaysia, Skudai
Volume :
3
fYear :
2008
fDate :
26-28 Aug. 2008
Firstpage :
1
Lastpage :
7
Abstract :
Statistical approach with non-fixed overlapping window size is able to make good identification of discontinuity in speech signal without further knowledge upon the phonetic sequence. This however, leads to increase number of insertion and thus increase confusion in recognition. This paper present a fusion between statistical and connectionist approach namely divergence algorithm and MLP neural network to improved segmentation by reducing insertions. The experiment conducted on Malay semi-spontaneous connected digit in classroom environment. The digit strings were manually segmented and trained using neural network with three set of data. The first training set trained without silence pattern, the second include silence while the last set introduced both silence and false pattern in the training. The experimental result on digit string segmentation shows number of insertion reduction of more than 5 times in comparison using divergence alone with increment of accuracy up to 40%.. The drawback however, the number of omission also increases to more than 10 times. Nevertheless, match segmentation rate still above 85%.
Keywords :
learning (artificial intelligence); multilayer perceptrons; natural language processing; speech processing; speech recognition; statistical analysis; string matching; MLP neural network; Malay semispontaneous connected digit; digit string segmentation; digit strings; divergence algorithm; insertion reduction; non-fixed overlapping window size; phonetic sequence; silence pattern; speech segmentation; speech signal; statistical approach; training set; Automatic speech recognition; Computer science; Electronic mail; Hidden Markov models; Humans; Neural networks; Signal processing; Speech analysis; Speech processing; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology, 2008. ITSim 2008. International Symposium on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-2327-9
Electronic_ISBN :
978-1-4244-2328-6
Type :
conf
DOI :
10.1109/ITSIM.2008.4632062
Filename :
4632062
Link To Document :
بازگشت