DocumentCode
14542
Title
A Probabilistic Model-Based Approach for Aligning Multiple Audio Sequences
Author
Basaran, Dogac ; Cemgil, Ali Taylan ; Anarim, Emin
Author_Institution
Dept. of Electr. & Electron. Eng., Bogazici Univ., İstanbul, Turkey
Volume
23
Issue
7
fYear
2015
fDate
Jul-15
Firstpage
1160
Lastpage
1171
Abstract
We formulate the alignment problem of multiple and partially overlapping audio sequences in a probabilistic framework. We define and compare five generative models for several time varying features extracted from audio clips that are recorded independently and asynchronously. For each model, we derive the associated scoring function that evaluates the quality of an alignment. The matching is then achieved with a sequential algorithm. The derived score functions are also able to identify the cases where the sequences do not overlap and handle multiple sequences where no sequence is covering the entire timeline. The simulation results on real data suggest that the approach is able to handle difficult, ambiguous scenarios and partial matchings where simple baseline methods such as correlation fail.
Keywords
audio signal processing; feature extraction; probability; alignment problem; associated scoring function; audio clips; derived score functions; generative models; multiple overlapping audio sequences; partial matchings; partially overlapping audio sequences; probabilistic framework; time varying features; Computational modeling; Correlation; Hamming distance; IEEE transactions; Noise; Speech; Speech processing; Audio alignment; audio matching; audio synchronization; fingerprinting; multi sequence alignment; probabilistic model;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2015.2419972
Filename
7079478
Link To Document