Title :
Privacy-preserving Query-by-Example Speech Search
Author :
Portelo, Jose ; Abad, Alberto ; Raj, Bhiksha ; Trancoso, Isabel
Author_Institution :
Inst. Super. Tecnico, Univ. de Lisboa, Lisbon, Portugal
Abstract :
This paper investigates a new privacy-preserving paradigm for the task of Query-by-Example Speech Search using Secure Binary Embeddings, a hashing method that converts vector data to bit strings through a combination of random projections followed by banded quantization. The proposed method allows performing spoken query search in an encrypted domain, by analyzing ciphered information computed from the original recordings. Unlike other hashing techniques, the embeddings allow the computation of the distance between vectors that are close enough, but are not perfect matches. This paper shows how these hashes can be combined with Dynamic Time Warping based on posterior derived features to perform secure speech search. Experiments performed on a sub-set of the Speech-Dat Portuguese corpus showed that the proposed privacy-preserving system obtains similar results to its non-private counterpart.
Keywords :
cryptography; data privacy; query formulation; speech processing; banded quantization; dynamic time warping; encrypted domain; hashing method; posterior derived features; privacy preserving speech search; query-by-example speech search; random projections; secure binary embeddings; spoken query search; vector data conversion; Acoustics; Euclidean distance; Europe; Feature extraction; Hamming distance; Quantization (signal); Speech; Data Privacy; Dynamic Time Warping; Query-by-Example Speech Search; Secure Binary Embeddings;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178280