Title :
Towards robust word discovery by self-similarity matrix comparison
Author :
Muscariello, Armando ; Gravier, Guillaume ; Bimbot, Frédéric
Abstract :
Word discovery is the task of discovering and collecting occurrences of repeating words in the absence of prior acoustic and linguistic knowledge, or training material. The capability of extracting such patterns (or motifs) represents a preliminary step towards automatic mining of contentful information in spoken documents. The absence of modelling and training data, forces the use of direct pattern matching of speech templates, which, in turn, is sensitive to speech variability, like the inter-speaker one, for instance. In the present work, a variability tolerant pattern recognition technique is proposed that relies on the comparison of self similarity matrices of speech sequences. The joint use of such technique and a dynamic time warping dissimilarity measure, is shown to account for more variability with respect to the DTW-based system alone, as demonstrated on several hours of broadcast news shows.
Keywords :
data mining; natural language processing; pattern matching; speech recognition; word processing; automatic mining; direct pattern matching; dynamic time warping dissimilarity measure; linguistic knowledge; patterns extraction; robust word discovery; self similarity matrices; self similarity matrix comparison; speech sequences; speech templates; speech variability; spoken documents; training material; variability tolerant pattern recognition; words repeation; Acoustics; Histograms; Libraries; Pattern matching; Pixel; Speech; Speech recognition; dynamic programming; histogram of oriented gradients; unsupervised learning; word discovery;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947639