Title :
A Probabilistic Model-Based Approach for Aligning Multiple Audio Sequences
Author :
Basaran, Dogac ; Cemgil, Ali Taylan ; Anarim, Emin
Author_Institution :
Dept. of Electr. & Electron. Eng., Bogazici Univ., İstanbul, Turkey
Abstract :
We formulate the alignment problem of multiple and partially overlapping audio sequences in a probabilistic framework. We define and compare five generative models for several time varying features extracted from audio clips that are recorded independently and asynchronously. For each model, we derive the associated scoring function that evaluates the quality of an alignment. The matching is then achieved with a sequential algorithm. The derived score functions are also able to identify the cases where the sequences do not overlap and handle multiple sequences where no sequence is covering the entire timeline. The simulation results on real data suggest that the approach is able to handle difficult, ambiguous scenarios and partial matchings where simple baseline methods such as correlation fail.
Keywords :
audio signal processing; feature extraction; probability; alignment problem; associated scoring function; audio clips; derived score functions; generative models; multiple overlapping audio sequences; partial matchings; partially overlapping audio sequences; probabilistic framework; time varying features; Computational modeling; Correlation; Hamming distance; IEEE transactions; Noise; Speech; Speech processing; Audio alignment; audio matching; audio synchronization; fingerprinting; multi sequence alignment; probabilistic model;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2015.2419972