DocumentCode
3494789
Title
An N-gram model for unstructured audio signals toward information retrieval
Author
Kim, Samuel ; Sundaram, Shiva ; Georgiou, Panayiotis ; Narayanan, Shrikanth
Author_Institution
Signal Anlaysis & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA
fYear
2010
fDate
4-6 Oct. 2010
Firstpage
477
Lastpage
480
Abstract
An N-gram modeling approach for unstructured audio signals is introduced with applications to audio information retrieval. The proposed N-gram approach aims to capture local dynamic information in acoustic words within the acoustic topic model framework which assumes an audio signal consists of latent acoustic topics and each topic can be interpreted as a distribution over acoustic words. Experimental results on classifying audio clips from BBC Sound Effects Library according to both semantic and onomatopoeic labels indicate that the proposed N-gram approach performs better than using only a bag-of-words approach by providing complementary local dynamic information.
Keywords
acoustic signal processing; audio signal processing; classification; information retrieval; BBC sound effects library; N-gram model; acoustic topic model framework; acoustic word; audio clip classification; audio information retrieval; onomatopoeic label; unstructured audio signal; Acoustics; Conferences; Dictionaries; Feature extraction; Information retrieval; Semantics; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing (MMSP), 2010 IEEE International Workshop on
Conference_Location
Saint Malo
Print_ISBN
978-1-4244-8110-1
Electronic_ISBN
978-1-4244-8111-8
Type
conf
DOI
10.1109/MMSP.2010.5662068
Filename
5662068
Link To Document