Title :
An N-gram model for unstructured audio signals toward information retrieval
Author :
Kim, Samuel ; Sundaram, Shiva ; Georgiou, Panayiotis ; Narayanan, Shrikanth
Author_Institution :
Signal Anlaysis & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA
Abstract :
An N-gram modeling approach for unstructured audio signals is introduced with applications to audio information retrieval. The proposed N-gram approach aims to capture local dynamic information in acoustic words within the acoustic topic model framework which assumes an audio signal consists of latent acoustic topics and each topic can be interpreted as a distribution over acoustic words. Experimental results on classifying audio clips from BBC Sound Effects Library according to both semantic and onomatopoeic labels indicate that the proposed N-gram approach performs better than using only a bag-of-words approach by providing complementary local dynamic information.
Keywords :
acoustic signal processing; audio signal processing; classification; information retrieval; BBC sound effects library; N-gram model; acoustic topic model framework; acoustic word; audio clip classification; audio information retrieval; onomatopoeic label; unstructured audio signal; Acoustics; Conferences; Dictionaries; Feature extraction; Information retrieval; Semantics; Speech;
Conference_Titel :
Multimedia Signal Processing (MMSP), 2010 IEEE International Workshop on
Conference_Location :
Saint Malo
Print_ISBN :
978-1-4244-8110-1
Electronic_ISBN :
978-1-4244-8111-8
DOI :
10.1109/MMSP.2010.5662068