Title :
The power of words: Enhancing music mood estimation with textual input of lyrics
Author :
Chi, Chung Yi ; Wu, Ying Shian ; Chu, Wei Rong ; Wu, Daniel C., Jr. ; Hsu, Jane Yung jen ; Tsai, Richard Tzong Han
Author_Institution :
Dept. of Comput. Sci. & Informantion Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Music mood estimation (MME) is a key technology in mood-based music recommendation. While mainstream MME research nowadays relies on audio music analysis, exploring the significance of lyrics text in predicting song emotion is gaining attention in recent years. One major impediment to MME research is the lack of a clearly labeled and publicly available dataset annotating the emotion ratings of lyrics text and audio separately. In light of this, we compiled a dataset of 600 pop songs (iPop) from the mood ratings of 246 participants who experienced three different song sessions, lyrics text (L), audio music track (M), and the combination of lyrics text and audio music track (C). We then applied statistical analysis to estimate how lyrics text and audio contribute to a song´s overall valence-arousal (V-A) mood ratings. Our results show that lyrics text are not only a valid measure for estimating a song´s mood ratings but also provide supplementary information that can improve audio-only MME systems. Furthermore, a detailed examination suggests that lyrics text (L) ratings are better estimators of the overall mood ratings of a song (C) in cases where L and M ratings conflict. We then construct a MME system that employs both features extracted from lyrics text and audio music track and validate the conclusions acquired in our statistical analysis. In estimating either V or A rating, the model with lyrics text plus audio track features performs better than only the model with only lyrics text or audio track features. These results validate the statement acquired by the statistical analysis.
Keywords :
behavioural sciences computing; information retrieval; music; statistical analysis; audio emotion ratings; audio music analysis; audio music track; lyrics text emotion ratings; lyrics text ratings; lyrics textual input; mood-based music recommendation; music mood estimation; song emotion prediction; statistical analysis; valence-arousal mood ratings; Computer science; Data mining; Feature extraction; Impedance; Mel frequency cepstral coefficient; Mood; Music information retrieval; Power engineering and energy; Recommender systems; Statistical analysis;
Conference_Titel :
Affective Computing and Intelligent Interaction and Workshops, 2009. ACII 2009. 3rd International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4244-4800-5
Electronic_ISBN :
978-1-4244-4799-2
DOI :
10.1109/ACII.2009.5349591