DocumentCode :
3484315
Title :
Single-channel particular voice activity detection for monitoring the violence situations
Author :
Hyun-Don Kim ; Sung-Su Ahn ; Kyoung-Ho Kim ; Jong-suk Choi
Author_Institution :
Daegu Mechatron. & Mater. Inst., Daegu, South Korea
fYear :
2013
fDate :
26-29 Aug. 2013
Firstpage :
412
Lastpage :
417
Abstract :
The proposed algorithm in this paper is capable of classifying not only unusual speech when people get anger, surprised, or excited but also unusual noise such as clashing, hitting, or clapping in real-time without depending on particular speaker voices or utterances. Also, it does not require a prior learning process to construct acoustic models. This algorithm, therefore, allows a surveillance camera system to effectively monitor quarrel or violent situations regardless of object shields and light conditions. To realize our approach, we analyze the variance and change of spectral densities and pitches when unusual speech and noise occur. We then propose new methods (SEBNI, USDF, and UNDF) to classify unusual sounds in real-time. Moreover, to improve performance, we apply a noise suppression system based on MMSE-STSA and a statistic model-based VAD to our algorithm in order to extract reliable voice features and segment only voice-related periods in noisy environments. We confirm that our proposed method achieves an 87% accuracy performance for classifying unusual speech.
Keywords :
behavioural sciences; computerised monitoring; feature extraction; real-time systems; signal denoising; spectral analysis; speech recognition; statistical analysis; MMSE-STSA; SEBNI method; UNDF method; USDF method; noise suppression system; noisy environments; performance improvement; pitch change; quarrel situation monitoring; real-time classification; reliable voice feature extraction; single-channel particular voice activity detection; spectral density change; spectral density variance; statistic model-based VAD; surveillance camera system; unusual noise classification; unusual sound classification; unusual speech classification; violent situation monitoring; voice-related periods; Equations; Feature extraction; Indexes; Mathematical model; Noise; Noise measurement; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
RO-MAN, 2013 IEEE
Conference_Location :
Gyeongju
ISSN :
1944-9445
Type :
conf
DOI :
10.1109/ROMAN.2013.6628514
Filename :
6628514
Link To Document :
بازگشت