• DocumentCode
    381476
  • Title

    An HMM-based approach to humming transcription

  • Author

    Shih, Hsuan-Huei ; Narayanan, Shrikanth S. ; Kuo, C. C Jay

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    337
  • Abstract
    Providing natural and efficient access to the fast growing multimedia information, accommodating a variety of user skills and preferences, is a critical aspect of content-based information mining. Query by humming provides a natural means for content-based retrieval from music databases. A statistical pattern recognition approach for recognizing hummed or sung melodies is reported in this paper. Being data-driven, the proposed system aims at providing a robust front-end especially for dealing with variability in user´s productions. The segment of a note in the humming waveform is modeled by a hidden Markov model (HMM) while data features such as pitch measures are modeled by Gaussian mixture models (GMM). Preliminary real-time recognition experiments are carried out based on humming data obtained from eight users and an overall correct recognition rate of around 80% is demonstrated.
  • Keywords
    Gaussian distribution; audio databases; content-based retrieval; data mining; feature extraction; hidden Markov models; multimedia databases; music; statistical analysis; Gaussian mixture models; HMM; content-based information mining; content-based retrieval; hidden Markov model; humming transcription; multimedia information; music databases; note segmentation; pitch measures; query by humming; real-time recognition; robust front-end; statistical pattern recognition; Content based retrieval; Decoding; Frequency; Hidden Markov models; Humans; Multimedia databases; Music information retrieval; Pattern recognition; Robustness; Signal processing algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7803-7304-9
  • Type

    conf

  • DOI
    10.1109/ICME.2002.1035787
  • Filename
    1035787