• DocumentCode
    2291645
  • Title

    Predominant pitch contour extraction from audio signals

  • Author

    Malik, Hafiz ; Khokhar, Ashfq ; Ansari, Rashid ; De Baillon, Bruno Cappe

  • Author_Institution
    Multimedia Syst. Lab, Chicago, IL, USA
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    257
  • Abstract
    This paper describes a computationally efficient method for estimating the predominant pitch in audio recordings. The proposed method is intended for building a content-based indexing and retrieval system that can search in a audio database using the melody line of a complex input audio sample. Available pitch estimation methods are effective primarily when dealing with recordings of human voice that is either unaccompanied or accompanied with one or two musical instruments. These methods perform poorly when applied to pitch estimation in complex music signals due to their reliance on directly estimating the fundamental frequency (F0), a task that is affected by the overlapping presence in frequency of instrumental sounds such as those of guitar, piano, etc. In our method we exploit the higher harmonic structure of the human voice to develop a low-complexity system for estimating predominant pitch. Experimental results show that this computationally efficient method provides a robust estimate of predominant pitch in real-world audio signals with 85% success rate.
  • Keywords
    audio databases; audio recording; audio signal processing; feature extraction; frequency estimation; music; query processing; audio database; audio recordings; computationally efficient method; content-based indexing system; content-based retrieval system; fundamental frequency estimation; guitar; harmonic structure; human voice; input audio sample; instrumental sounds; low-complexity system; melody line; music signals; musical instruments; piano; predominant pitch contour extraction; predominant pitch estimation; real-world audio signals; Audio databases; Audio recording; Content based retrieval; Frequency estimation; Human voice; Indexing; Instruments; Multiple signal classification; Music information retrieval; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7803-7304-9
  • Type

    conf

  • DOI
    10.1109/ICME.2002.1035568
  • Filename
    1035568