• DocumentCode
    698013
  • Title

    Singing voice detection in monophonic and polyphonic contexts

  • Author

    Lachambre, Helene ; Andre-Obrecht, Regine ; Pinquier, Julien

  • Author_Institution
    IRIT - Univ. de Toulouse, Narbonne, France
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    1344
  • Lastpage
    1348
  • Abstract
    In this article, we present an improvement of a previous singing voice detector. This new detector is in two steps. First, we distinguish monophonies from polyphonies. This distinction is based on the fact that the pitch estimated in a monophony is more reliable than the one estimated in a polyphony. We study the short term mean and variance of a confidence indicator; their repartition is modelled with bivariate Weibull distributions. We present a new method to estimate the parameters of these distributions with the moment method. Then, we detect the presence of singing voice. This is done by looking for the presence of vibrato, an oscillation of the fundamental frequency between 4 and 8 Hz. In a monophonic context, we look for vibrato on the pitch. In a polyphonic context, we first make a frequency tracking on the whole spectrogram, and then look for vibrato on each frequency tracks. Results are promising: from a global error rate of 29.7 % (previous method), we fall to a global error rate of 25 %. This means that taking into account the context (monophonic or polyphonic) leads to a relative gain of more than 16 %.
  • Keywords
    music; speech recognition; bivariate Weibull distribution; confidence indicator; moment method; monophonic context; polyphonic context; singing voice detection; spectrogram; vibrato; Computational modeling; Context; Detectors; Error analysis; Harmonic analysis; Instruments; Reliability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077587