DocumentCode :
2819324
Title :
Methods for singing voice identification using energy coefficients as features
Author :
Mesaros, Annamaria ; Moldovan, Simina
Author_Institution :
Dept. of Commun., Tech. Univ. of Cluj-Napoca
Volume :
2
fYear :
2006
fDate :
25-28 May 2006
Firstpage :
161
Lastpage :
166
Abstract :
This paper describes two energy representations of the voice signal and tests their efficiency in singing voice identification. The first set of energy features consists in the Mel-scale energies of 14 frequency bands, covering the whole frequency spectrum of the signal. The second energy representation is obtained by wavelet decomposition of the voice signal. The wavelet and scaling filters for the decomposition are derived from fractional B-spline functions. The wavelet decomposition is done hierarchically, into 14 bands, with octave-band filters, taking into account the specific frequencies of the formants. Both energy representations are tested for singing voice identification on the training set and on unknown data
Keywords :
filtering theory; music; signal representation; speaker recognition; splines (mathematics); Mel-scale energy; energy coefficients; energy representation; fractional B-spline functions; frequency spectrum; octave-band filtering; scaling filtering; singing voice identification; voice signal; wavelet decomposition; wavelet filtering; Fourier transforms; Frequency; Instruments; Power harmonic filters; Signal analysis; Signal processing; Speech analysis; Speech recognition; Testing; Timbre;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automation, Quality and Testing, Robotics, 2006 IEEE International Conference on
Conference_Location :
Cluj-Napoca
Print_ISBN :
1-4244-0360-X
Electronic_ISBN :
1-4244-0361-8
Type :
conf
DOI :
10.1109/AQTR.2006.254623
Filename :
4022946
Link To Document :
بازگشت