Title :
Identification of Voice Segments in Stereo Soundtracks
Author :
Padhi, Kabi Prakash ; Mahapatra, Rabi N.
Author_Institution :
Texas A&M Univ., Galveston, TX
Abstract :
While the human auditory system is easily able to distinguish between the voice and the audio components of a soundtrack, this appears to be a significant challenge to any computer model due to the overlap of audio and speech spectrum. The algorithm developed is able to discern the presence of voice in a stereo input with a high degree of accuracy. It is based on time-domain energy criterion, which results in lower computations in comparison to frequency domain techniques
Keywords :
audio signal processing; speech processing; time-domain analysis; human auditory system; stereo soundtrack; time-domain energy criterion; voice segment identification; Auditory system; Delay; Frequency domain analysis; Histograms; Humans; Instruments; Parameter estimation; Speech; Time domain analysis; Time frequency analysis;
Conference_Titel :
Information, Communications and Signal Processing, 2005 Fifth International Conference on
Conference_Location :
Bangkok
Print_ISBN :
0-7803-9283-3
DOI :
10.1109/ICICS.2005.1689322