Title :
Detecting offensive user video blogs: An adaptive keyword spotting approach
Author :
Barakat, M.S. ; Ritz, C.H. ; Stirling, D.A.
Author_Institution :
Sch. of Electr., Comput. & Telecommun. Eng., Univ. of Wollongong, Wollongong, NSW, Australia
Abstract :
This paper proposes a speaker independent keyword spotting (KWS) approach applied to the audio track of user video blogs that can help in their automatic analysis, indexing, search and retrieval. The approach, which relies on matching of keyword templates to speech segments using an adaptive similarity threshold that is estimated automatically for each utterance, does not require training data or language model as required in existing approaches such as those based on the Hidden Markov Model (HMM). This is a particular advantage for user video blogs since they usually contain words of interest that have not been adequately represented in a training database. Experiments conducted to detect offensive words in video blogs achieved much higher accuracy than existing speech-to-text based approaches.
Keywords :
audio signal processing; indexing; information retrieval; social networking (online); speaker recognition; speech synthesis; KWS approach; adaptive keyword spotting approach; adaptive similarity threshold; audio track; automatic analysis; indexing; keyword template matching; offensive user video blog detection; retrieval; search; speaker independent keyword spotting approach; speech segments; speech-to-text based approach; Blogs; Databases; Hidden Markov models; Histograms; Speech; Speech recognition; Vectors;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376654