Title :
Temporal Dynamics for Spectral Sub-Band Centroid Audio Fingerprints
Author :
Jin, Minho ; Yoo, Chang D.
Author_Institution :
Korea Adv. Inst. of Sci. & Technol., Daejeon
Abstract :
Motivated by the effectual use of temporal information in speech recognition, we investigate the effectiveness of the temporal dynamics of the spectral sub-band centroid (SSC) fingerprints for audio fingerprinting. The SSC, which is known to be a robust audio fingerprint against various distortions, does not involve any temporal dynamics. Here, the temporal dynamics are defined as the difference between two neighboring SSCs. The robustness of the temporal dynamics against various distortions were compared to that of SSCs. The system using temporal dynamics showed similar performance to that using SSCs in various distortions except for time-scale modification and linear-speed change. This is to be expected since these distortions change the time correlation of an audio. In our experiment, the concatenation of SSCs and the temporal dynamics outperformed each of the individual fingerprints. This suggests that the SSCs and the temporal dynamics provide information which is mutually supplementary.
Keywords :
audio signal processing; speech processing; speech recognition; audio fingerprints; spectral subband centroid fingerprints; speech recognition; temporal dynamics; Copyright protection; Fingerprint recognition; Hidden Markov models; Information technology; Multimedia databases; Multimedia systems; Robustness; Software development management; Speech recognition; Technology management;
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
DOI :
10.1109/ICME.2007.4284616