• DocumentCode
    1749444
  • Title

    A predominant-F0 estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models

  • Author

    Goto, Masataka

  • Author_Institution
    PRESTO, Japan Sci. & Technol. Corp., Tsukuba, Japan
  • Volume
    5
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    3365
  • Abstract
    This paper describes a predominant-F0 (fundamental frequency) estimation method called PreFEst, which can detect melody and bass lines in monaural audio signals containing sounds of various instruments, While most previous methods premised mixtures of a few sounds and had difficulty dealing with such complex signals, our method can estimate the F0 of the melody and bass lines without assuming the number of sound sources in compact-disc recordings. In this paper we propose the following three extensions to our previous PreFEst to make it more adaptive and flexible: introducing multiple harmonic-structure tone models, estimating the shape of tone models, and introducing a prior distribution of its shape and F0 estimates These extensions were implemented by the MAP (maximum a posteriori probability) estimation by using the expectation-maximization algorithm. Experimental results with compact-disc recordings showed that our real-time system based on the extended PreFEst achieved performance improvement
  • Keywords
    audio signal processing; frequency estimation; iterative methods; maximum likelihood estimation; music; CD recordings; EM algorithm; MAP estimation; PreFEst; adaptive tone models; bass lines; compact-disc recordings; expectation-maximization algorithm; fundamental frequency estimation method; maximum a posteriori probability; melody; monaural audio signals; multiple harmonic-structure tone models; predominant-F0 estimation method; prior distribution; real-time system; tone models shape; Adaptive signal detection; Audio recording; CD recording; Disk recording; Frequency estimation; Humans; Instruments; Laboratories; Real time systems; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940380
  • Filename
    940380