An Efficient Time-Frequency Domain Speech Perceptual Hashing Authentication Algorithm Based on Discrete Wavelet Transform

Author

Zhang Qiu-Yu ; Xing Peng-Fei ; Huang Yi-Bo ; Dong Rui-Hong ; Yang Zhong-ping

Author_Institution

Sch. of Comput. & Commun., Lanzhou Univ. of Technol., Lanzhou, China

fYear

2014

fDate

8-10 Nov. 2014

Firstpage

622

Lastpage

627

Abstract

According to the situation that speech authentication algorithms are not appropriated for real-time speech content authentication, a novel speech perceptual hashing authentication algorithm based on discrete wavelet transform with combination of time-frequency domain features was proposed. Firstly, by discrete wavelet transform (DWT), a new signal in frequency domain is generated from the original speech signal after pre-processing and intensity-loudness transform (ILT). Next, the algorithm partitions low frequency wavelet decomposition coefficients into equal-sized and non-overlapping blocks, and then computes logarithmic short-time energy of each block to obtain speech signal´s features in frequency domain. Finally, combining with spectral flux features (SFF) of speech signal in time domain, a ternary perceptual hashing sequence is created. Experiment results show that ternary form is better to stand for hashing digest than binary form, the proposed algorithm has a good robustness against content preserving operations, discrimination, compactness and high efficiency, and detects the tamper localization as well.

Keywords

cryptography; discrete wavelet transforms; speech processing; time-frequency analysis; DWT; ILT; SFF; content preserving operations; discrete wavelet transform; intensity-loudness transform; logarithmic short-time energy; low frequency wavelet decomposition coefficients; real-time speech content authentication; spectral flux features; speech signal features; tamper localization detection; ternary perceptual hashing sequence; time-frequency domain speech perceptual hashing authentication algorithm; Authentication; Bit error rate; Discrete wavelet transforms; Feature extraction; Robustness; Speech; discrete wavelet transform; perceptual hashing; robustness; speech content authentication; time-frequency domain;

fLanguage

English

Publisher

ieee

Conference_Titel

P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2014 Ninth International Conference on

Conference_Location

Guangdong

Type

conf

DOI

10.1109/3PGCIC.2014.55

Filename

7024657