DocumentCode :
2582881
Title :
Content based audio retrieval with MFCC feature extraction, clustering and sort-merge techniques
Author :
Nagavi, Trisiladevi C. ; Anusha, S.B. ; Monisha, P. ; Poornima, S.P.
Author_Institution :
Dept. of Comput. Sci. & Eng., S.J. Coll. of Eng., Mysore, India
fYear :
2013
fDate :
4-6 July 2013
Firstpage :
1
Lastpage :
6
Abstract :
Content Based Audio Retrieval (CBAR) has been a growing field of research for the past decade. To be capable of classifying and accessing the audio files, relevant to user´s concern, is fundamental for structuring multimedia web search engines. This paper proposes a technique to build a system to retrieve audio files by acoustic similarity using Sort-Merge technique. The frequency features of the audio streams are extracted. We consider Mel Frequency Cepstral Coefficients (MFCC) for dimensionality reduction. The mean of the coefficients for the key song and for all the songs in the database is taken. Then, difference measure is calculated using Euclidian Distance. This retrieval is tested on a corpus of songs sung by both professional and non-professional singers. When a query audio is given, the system first finds the clusters with identical high energy components, merges them and then the audio files in all the merged clusters are sorted according to their distances. With this approach, we can classify and retrieve standard audios more precisely, using fewer features and less computation time.
Keywords :
audio signal processing; content-based retrieval; feature extraction; merging; pattern clustering; signal classification; sorting; CBAR; Euclidian distance; MFCC feature extraction; Mel frequency cepstral coefficients; acoustic similarity; audio file access; audio file classification; clustering technique; content based audio retrieval; difference measure; dimensionality reduction; frequency feature extraction; multimedia Web search engines; sort-merge technique; Accuracy; Arrays; Databases; Feature extraction; Mel frequency cepstral coefficient; Multimedia communication; Clustering; Euclidian Distance; Mel Frequency Cepstral Co-efficient (MFCC); Sort- Merge;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Communications and Networking Technologies (ICCCNT),2013 Fourth International Conference on
Conference_Location :
Tiruchengode
Print_ISBN :
978-1-4799-3925-1
Type :
conf
DOI :
10.1109/ICCCNT.2013.6850234
Filename :
6850234
Link To Document :
بازگشت