DocumentCode
3225075
Title
Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals
Author
Fujihara, Hiromasa ; Goto, Masataka ; Ogata, Jun ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.
Author_Institution
Dept. of Intelligence Sci. & Technol., Kyoto Univ.
fYear
2006
fDate
Dec. 2006
Firstpage
257
Lastpage
264
Abstract
This paper describes a system that can automatically synchronize between polyphonic musical audio signals and corresponding lyrics. Although there were methods that can synchronize between monophonic speech signals and corresponding text transcriptions by using Viterbi alignment techniques, they cannot be applied to vocals in CD recordings because accompaniment sounds often overlap with vocals. To align lyrics with such vocals, we therefore developed three methods: a method for segregating vocals from polyphonic sound mixtures, a method for detecting vocal sections, and a method for adapting a speech-recognizer phone model to segregated vocal signals. Experimental results for 10 Japanese popular-music songs showed that our system can synchronize between music and lyrics with satisfactory accuracy for 8 songs
Keywords
Viterbi detection; audio discs; audio recording; audio signal processing; music; speech recognition; synchronisation; Japanese song; Viterbi alignment technique; automatic synchronization; lyrics; music CD recording; polyphonic musical audio signal; segregated vocal signal; speech-recognizer phone model; Adaptive signal detection; Auditory displays; Automatic speech recognition; CD recording; Hidden Markov models; Informatics; Multimedia systems; Multiple signal classification; Speech recognition; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia, 2006. ISM'06. Eighth IEEE International Symposium on
Conference_Location
San Diego, CA
Print_ISBN
0-7695-2746-9
Type
conf
DOI
10.1109/ISM.2006.38
Filename
4061176
Link To Document