DocumentCode :
14542
Title :
A Probabilistic Model-Based Approach for Aligning Multiple Audio Sequences
Author :
Basaran, Dogac ; Cemgil, Ali Taylan ; Anarim, Emin
Author_Institution :
Dept. of Electr. & Electron. Eng., Bogazici Univ., İstanbul, Turkey
Volume :
23
Issue :
7
fYear :
2015
fDate :
Jul-15
Firstpage :
1160
Lastpage :
1171
Abstract :
We formulate the alignment problem of multiple and partially overlapping audio sequences in a probabilistic framework. We define and compare five generative models for several time varying features extracted from audio clips that are recorded independently and asynchronously. For each model, we derive the associated scoring function that evaluates the quality of an alignment. The matching is then achieved with a sequential algorithm. The derived score functions are also able to identify the cases where the sequences do not overlap and handle multiple sequences where no sequence is covering the entire timeline. The simulation results on real data suggest that the approach is able to handle difficult, ambiguous scenarios and partial matchings where simple baseline methods such as correlation fail.
Keywords :
audio signal processing; feature extraction; probability; alignment problem; associated scoring function; audio clips; derived score functions; generative models; multiple overlapping audio sequences; partial matchings; partially overlapping audio sequences; probabilistic framework; time varying features; Computational modeling; Correlation; Hamming distance; IEEE transactions; Noise; Speech; Speech processing; Audio alignment; audio matching; audio synchronization; fingerprinting; multi sequence alignment; probabilistic model;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2419972
Filename :
7079478
Link To Document :
بازگشت