• DocumentCode
    3461131
  • Title

    Linear hidden Markov model for music information retrieval based on humming

  • Author

    Liu, Baolong ; Wu, Yadong ; Li, Yang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
  • Volume
    5
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    Recently, some studies have placed emphasis on statistical analysis in music information retrieval (MIR). The paper is concerned with applying a linear hidden Markov model (HMM) with three kinds of states, S, C and D, as the matching mechanism for a query by a humming system. Note segmentation, pitch tracking and the database of the system are briefly introduced. The paper analyzes six probable errors in humming and proposes the SCD HMM to model each song. Each of the states, S, C and D, represents two of the six errors. The SCD HMM describes all kinds of possibilities of errors in a hummed query. Each query can find a most probable state sequence in a SCD HMM and get a probability score that determines the similarity between the query and the candidate songs. The retrieval system contains about 1000 Chinese folk songs. Experimental results show that the model is robust to the six errors and generally a 90% matching accuracy (listed on top 5) can be achieved.
  • Keywords
    audio databases; audio signal processing; hidden Markov models; music; query processing; Chinese folk songs; humming; linear hidden Markov model; music database; music information retrieval; note segmentation; pitch tracking; state sequence; statistical analysis; Computer science; Content based retrieval; Databases; Dynamic programming; Error analysis; Hidden Markov models; Multiple signal classification; Music information retrieval; Robustness; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1200024
  • Filename
    1200024