DocumentCode :
3641667
Title :
Model based audio sequence alignment
Author :
Doğaç Başaran;Emin Anarım;Ali Taylan Cemgil
Author_Institution :
Elektrik ve Elektronik Mü
fYear :
2011
fDate :
4/1/2011 12:00:00 AM
Firstpage :
606
Lastpage :
609
Abstract :
We formulate alignment of multiple audio sequences in a probabilistic framework. Our approach defines a generative model for time varying features extracted from audio clips that are recorded independently and asynchronously. We are able to handle missing data and multiple clips where no clip is covering the entire material. The matching is achieved via approximate Bayesian inference. Here, we illustrate a simulated tempering approach for sampling from the exact posterior density of the clip offsets. The simulation results on synthetic and real data suggest that the framework is able to handle difficult ambiguous scenarios or partial matchings.
Keywords :
"Markov processes","Conferences","Bayesian methods","Speech processing","Feature extraction","Speech"
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications (SIU), 2011 IEEE 19th Conference on
ISSN :
2165-0608
Print_ISBN :
978-1-4577-0462-8
Type :
conf
DOI :
10.1109/SIU.2011.5929723
Filename :
5929723
Link To Document :
بازگشت