• DocumentCode
    2931412
  • Title

    Audio contributions to semantic video search

  • Author

    Trancoso ; Pellegrini, T. ; Elo, J. Port ; Meinedo, H. ; Bugalho, M. ; Abad, A. ; Neto, J.

  • Author_Institution
    INESC-ID Lisboa, Portugal
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    630
  • Lastpage
    633
  • Abstract
    This paper summarizes the contributions to semantic video search that can be derived from the audio signal. Because of space restrictions, the emphasis will be on non-linguistic cues. The paper thus covers what is generally known as audio segmentation, as well as audio event detection. Using machine learning approaches, we have built detectors for over 50 semantic audio concepts.
  • Keywords
    audio signal processing; learning (artificial intelligence); search engines; video signal processing; audio contributions; audio event detection; audio segmentation; audio signal; machine learning approaches; nonlinguistic cues; semantic video search; space restrictions; Acoustic signal detection; Detectors; Event detection; Feature extraction; Hidden Markov models; Linear discriminant analysis; Loudspeakers; Machine learning; Principal component analysis; Speech recognition; Audio Event Detection; Audio Segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202575
  • Filename
    5202575