• DocumentCode
    3494789
  • Title

    An N-gram model for unstructured audio signals toward information retrieval

  • Author

    Kim, Samuel ; Sundaram, Shiva ; Georgiou, Panayiotis ; Narayanan, Shrikanth

  • Author_Institution
    Signal Anlaysis & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2010
  • fDate
    4-6 Oct. 2010
  • Firstpage
    477
  • Lastpage
    480
  • Abstract
    An N-gram modeling approach for unstructured audio signals is introduced with applications to audio information retrieval. The proposed N-gram approach aims to capture local dynamic information in acoustic words within the acoustic topic model framework which assumes an audio signal consists of latent acoustic topics and each topic can be interpreted as a distribution over acoustic words. Experimental results on classifying audio clips from BBC Sound Effects Library according to both semantic and onomatopoeic labels indicate that the proposed N-gram approach performs better than using only a bag-of-words approach by providing complementary local dynamic information.
  • Keywords
    acoustic signal processing; audio signal processing; classification; information retrieval; BBC sound effects library; N-gram model; acoustic topic model framework; acoustic word; audio clip classification; audio information retrieval; onomatopoeic label; unstructured audio signal; Acoustics; Conferences; Dictionaries; Feature extraction; Information retrieval; Semantics; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing (MMSP), 2010 IEEE International Workshop on
  • Conference_Location
    Saint Malo
  • Print_ISBN
    978-1-4244-8110-1
  • Electronic_ISBN
    978-1-4244-8111-8
  • Type

    conf

  • DOI
    10.1109/MMSP.2010.5662068
  • Filename
    5662068