• DocumentCode
    3196581
  • Title

    A Vector-Based Approach to Broadcast Audio Database Indexing and Retrieval

  • Author

    Wang, Lei ; Li, Haizhou ; Chng, Eng Siong

  • Author_Institution
    Nanyang Technol. Univ., Singapore
  • fYear
    2007
  • fDate
    2-5 July 2007
  • Firstpage
    512
  • Lastpage
    515
  • Abstract
    This paper proposes a novel framework to index and retrieve audio content from broadcast database that contains both speech and music. In this framework, we model the acoustic events using hidden Markov models, which are then used to decode the audio content. The decoding results in the form of acoustic token sequence and acoustic lattice are used to generate features for indexing and retrieval with the vector space model. Experiments were carried out on the TRECVID database and the results showed that the proposed framework is effective in audio information retrieval. The results also showed that the features generated from the acoustic lattice provide more accurate information than token sequence.
  • Keywords
    acoustic signal processing; audio coding; audio databases; database indexing; hidden Markov models; information retrieval; music; speech processing; acoustic events; acoustic lattice; acoustic token sequence; audio database indexing; audio database retrieval; audio decoding; broadcast database; hidden Markov model; music; speech; vector space model; Audio databases; Broadcasting; Decoding; Hidden Markov models; Indexes; Indexing; Information retrieval; Lattices; Music information retrieval; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2007 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-1016-9
  • Electronic_ISBN
    1-4244-1017-7
  • Type

    conf

  • DOI
    10.1109/ICME.2007.4284699
  • Filename
    4284699