• DocumentCode
    698052
  • Title

    Adaptation of a speech recognizer for singing voice

  • Author

    Mesaros, Annamaria ; Virtanen, Tuomas

  • Author_Institution
    Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    1779
  • Lastpage
    1783
  • Abstract
    This paper studies the speaker adaptation techniques that can be applied for adapting a speech recognizer to singing voice. Maximum likelihood linear regression (MLLR) techniques are studied, with specific details in choosing the number and types of transforms. The recognition performance of the different methods is measured in terms of phoneme recognition rate and singing-to-lyrics alignment errors of the adapted recognizers. Different methods improve the correct recognition rate with up to 10 percentage units, compared to the non-adapted system. In singing-to-lyrics alignment we obtain a best of 0.94 seconds mean absolute alignment error, compared to 1.26 seconds for the non-adapted system. Global adaptation was found to provide the most improvement in the performance, but small further improvement was obtained with regression tree adaptation.
  • Keywords
    maximum likelihood estimation; regression analysis; speech recognition; trees (mathematics); maximum likelihood linear regression; phoneme recognition rate; regression tree; singing voice; singing-to-lyrics alignment error; speaker adaptation technique; speech recognizer; Abstracts; Adaptation models; Arctic; Hidden Markov models; Speech; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077626