DocumentCode :
247168
Title :
An Efficient Time-Frequency Domain Speech Perceptual Hashing Authentication Algorithm Based on Discrete Wavelet Transform
Author :
Zhang Qiu-Yu ; Xing Peng-Fei ; Huang Yi-Bo ; Dong Rui-Hong ; Yang Zhong-ping
Author_Institution :
Sch. of Comput. & Commun., Lanzhou Univ. of Technol., Lanzhou, China
fYear :
2014
fDate :
8-10 Nov. 2014
Firstpage :
622
Lastpage :
627
Abstract :
According to the situation that speech authentication algorithms are not appropriated for real-time speech content authentication, a novel speech perceptual hashing authentication algorithm based on discrete wavelet transform with combination of time-frequency domain features was proposed. Firstly, by discrete wavelet transform (DWT), a new signal in frequency domain is generated from the original speech signal after pre-processing and intensity-loudness transform (ILT). Next, the algorithm partitions low frequency wavelet decomposition coefficients into equal-sized and non-overlapping blocks, and then computes logarithmic short-time energy of each block to obtain speech signal´s features in frequency domain. Finally, combining with spectral flux features (SFF) of speech signal in time domain, a ternary perceptual hashing sequence is created. Experiment results show that ternary form is better to stand for hashing digest than binary form, the proposed algorithm has a good robustness against content preserving operations, discrimination, compactness and high efficiency, and detects the tamper localization as well.
Keywords :
cryptography; discrete wavelet transforms; speech processing; time-frequency analysis; DWT; ILT; SFF; content preserving operations; discrete wavelet transform; intensity-loudness transform; logarithmic short-time energy; low frequency wavelet decomposition coefficients; real-time speech content authentication; spectral flux features; speech signal features; tamper localization detection; ternary perceptual hashing sequence; time-frequency domain speech perceptual hashing authentication algorithm; Authentication; Bit error rate; Discrete wavelet transforms; Feature extraction; Robustness; Speech; discrete wavelet transform; perceptual hashing; robustness; speech content authentication; time-frequency domain;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2014 Ninth International Conference on
Conference_Location :
Guangdong
Type :
conf
DOI :
10.1109/3PGCIC.2014.55
Filename :
7024657
Link To Document :
بازگشت