• DocumentCode
    699869
  • Title

    Evaluation of audio features for audio-visual analysis of dance figures

  • Author

    Demir, Y. ; Erzin, E. ; Yemez, Y. ; Tekalp, A.M.

  • Author_Institution
    Koc Univ., Istanbul, Turkey
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    We present a framework for selecting best audio features for audio-visual analysis and synthesis of dance figures. Dance figures are performed synchronously with the musical rhythm. They can be analyzed through the audio spectra using spectral and rhythmic musical features. In the proposed audio feature evaluation system, dance figures are manually labeled over the video stream. The music segments, which correspond to labeled dance figures, are used to train hidden Markov model (HMM) structures to learn spectral audio patterns for the dance figure melodies. The melody recognition performances of the HMM models for various spectral feature sets are evaluated. Audio features, which are maximizing dance figure melody recognition performances, are selected as the best audio features for the analyzed audio-visual dance recordings. In our evaluations, mel-scale cepstral coefficients (MFCC) with their first and second derivatives, spectral centroid, spectral flux and spectral roll-off are used as candidate audio features. Selection of the best audio features can be used towards analysis and synthesis of audio-driven body animation.
  • Keywords
    audio signal processing; audio-visual systems; cepstral analysis; feature extraction; hidden Markov models; music; HMM structures; audio feature evaluation system; audio spectra; audio-driven body animation; audio-visual analysis; dance figure melody recognition; dance figure synthesis; hidden Markov model; mel-scale cepstral coefficients; musical rhythm; rhythmic musical features; spectral audio patterns; spectral centroid; spectral flux; spectral musical features; spectral roll-off; video stream; Abstracts; Frequency modulation; Hidden Markov models; Magnetic resonance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080401