• DocumentCode
    14542
  • Title

    A Probabilistic Model-Based Approach for Aligning Multiple Audio Sequences

  • Author

    Basaran, Dogac ; Cemgil, Ali Taylan ; Anarim, Emin

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Bogazici Univ., İstanbul, Turkey
  • Volume
    23
  • Issue
    7
  • fYear
    2015
  • fDate
    Jul-15
  • Firstpage
    1160
  • Lastpage
    1171
  • Abstract
    We formulate the alignment problem of multiple and partially overlapping audio sequences in a probabilistic framework. We define and compare five generative models for several time varying features extracted from audio clips that are recorded independently and asynchronously. For each model, we derive the associated scoring function that evaluates the quality of an alignment. The matching is then achieved with a sequential algorithm. The derived score functions are also able to identify the cases where the sequences do not overlap and handle multiple sequences where no sequence is covering the entire timeline. The simulation results on real data suggest that the approach is able to handle difficult, ambiguous scenarios and partial matchings where simple baseline methods such as correlation fail.
  • Keywords
    audio signal processing; feature extraction; probability; alignment problem; associated scoring function; audio clips; derived score functions; generative models; multiple overlapping audio sequences; partial matchings; partially overlapping audio sequences; probabilistic framework; time varying features; Computational modeling; Correlation; Hamming distance; IEEE transactions; Noise; Speech; Speech processing; Audio alignment; audio matching; audio synchronization; fingerprinting; multi sequence alignment; probabilistic model;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2419972
  • Filename
    7079478