DocumentCode :
697764
Title :
Pattern extraction in sparse representations with application to audio coding
Author :
Pichevar, Ramin ; Najaf-Zadeh, Hossein
Author_Institution :
Commun. Res. Centre, Ottawa, ON, Canada
fYear :
2009
fDate :
24-28 Aug. 2009
Firstpage :
1249
Lastpage :
1253
Abstract :
This article deals with the extraction of frequency-domain auditory objects in sparse representations. To do so, we first generate sparse audio representations we call spikegrams, based on neural spikes using gammatone/gammachirp kernels and matching pursuit. We then propose a method to extract frequent auditory objects (patterns) in the afore-mentioned sparse representations. The extracted frequency-domain patterns help us address spikes (atoms or auditory events) collectively rather than individually. When audio compression is needed, the different patterns are stored in a small codebook that can be used to efficiently encode audio materials in a lossless way. The approach is applied to different audio signals and results are discussed and compared. Our experiments show that substantial coding gain is obtained when our technique based on pattern extraction is used as opposed to the case where spikes (atoms) are coded individually. This work is a first step towards the design of a high-quality “object-based” audio coder.
Keywords :
audio coding; feature extraction; iterative methods; audio coding; audio compression; frequency-domain auditory object extraction; gammachirp kernel; gammatone kernel; high quality object based audio coder; matching pursuit; neural spikes; pattern extraction; sparse audio representations; sparse representation; spikegram; Abstracts; Biological information theory; Bit rate; Encoding; Frequency-domain analysis; Organizations; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2009 17th European
Conference_Location :
Glasgow
Print_ISBN :
978-161-7388-76-7
Type :
conf
Filename :
7077336
Link To Document :
بازگشت