Title :
A novel framework for recognizing phonemes of singing voice in polyphonic music
Author :
Fujihara, Hiromasa ; Goto, Masataka ; Okuno, Hiroshi G.
Author_Institution :
Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba, Japan
Abstract :
A novel method is described that can be used to recognize the phoneme of a singing voice (vocal) in polyphonic music. Though we focus on the voiced phoneme in this paper, this method is design to concurrently recognize other elements of a singing voice such as fundamental frequency and singer. Thus, this method is considered to be a new framework for recognizing a singing voice in polyphonic music. Our method stochastically models a mixture of a singing voice and other instrumental sounds without segregating the singing voice. It can also estimate a reliable spectral envelope by estimating it from many harmonic structures with various fundamental frequencies (F0s). The results of phoneme recognition experiments with 10 popular-music songs by 6 singers showed that our method improves the recognition accuracy by 8.7 points and achieves a 20.0% decrease in error rate.
Keywords :
music; speech recognition; fundamental frequency; instrumental sounds; phonemes recognition; polyphonic music; singing voice; Automatic speech recognition; Conferences; Degradation; Design methodology; Frequency estimation; Frequency synchronization; Instruments; Multiple signal classification; Music; Speech recognition; Mixture of experts; Phoneme recognition; Singing voice; Spectral modeling;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location :
New Paltz, NY
Print_ISBN :
978-1-4244-3678-1
Electronic_ISBN :
1931-1168
DOI :
10.1109/ASPAA.2009.5346497