• DocumentCode
    1379238
  • Title

    Automatic Evaluation of Karaoke Singing Based on Pitch, Volume, and Rhythm Features

  • Author

    Tsai, Wei-Ho ; Lee, Hsin-Chieh

  • Author_Institution
    Dept. of Electron. Eng., Nat. Taipei Univ. of Technol., Taipei, Taiwan
  • Volume
    20
  • Issue
    4
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    1233
  • Lastpage
    1243
  • Abstract
    This study aims to develop an automatic singing evaluation system for Karaoke performances. Many Karaoke systems in the market today come with a scoring function. The addition of the feature enhances the entertainment appeal of the system due to the competitive nature of humans. The automatic Karaoke scoring mechanism to date, however, is still rudimentary, often giving inconsistent results with scoring by human raters. A cause of blunder arises from the fact that often only the singing volume is used as the evaluation criteria. To improve on the singing evaluation capabilities on Karaoke machines, this study exploits various acoustic features, including pitch, volume, and rhythm to assess a singing performance. We invited a number of singers having different levels of singing capabilities to record for Karaoke solo vocal samples. The performances were rated independently by four musicians, and then used in conjunction with additional Karaoke Video Compact Disk music for the training of our proposed system. Our experiment shows that the results of automatic singing evaluation are close to the human rating, where the Pearson product-moment correlation coefficient between them is 0.82.
  • Keywords
    audio signal processing; music; Pearson product-moment correlation coefficient; automatic singing evaluation system; karaoke scoring mechanism; karaoke singing; karaoke video compact disk music; pitch features; rhythm features; volume features; Accuracy; Humans; Lead; Rhythm; Timbre; Accompaniment; Karaoke; singing evaluation; solo vocal;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2174224
  • Filename
    6084727