• DocumentCode
    1413591
  • Title

    A Tandem Algorithm for Singing Pitch Extraction and Voice Separation From Music Accompaniment

  • Author

    Hsu, Chao-Ling ; Wang, DeLiang ; Jang, Jyh-Shing Roger ; Hu, Ke

  • Author_Institution
    Mediatek Inc., Hsinchu, Taiwan
  • Volume
    20
  • Issue
    5
  • fYear
    2012
  • fDate
    7/1/2012 12:00:00 AM
  • Firstpage
    1482
  • Lastpage
    1491
  • Abstract
    Singing pitch estimation and singing voice separation are challenging due to the presence of music accompaniments that are often nonstationary and harmonic. Inspired by computational auditory scene analysis (CASA), this paper investigates a tandem algorithm that estimates the singing pitch and separates the singing voice jointly and iteratively. Rough pitches are first estimated and then used to separate the target singer by considering harmonicity and temporal continuity. The separated singing voice and estimated pitches are used to improve each other iteratively. To enhance the performance of the tandem algorithm for dealing with musical recordings, we propose a trend estimation algorithm to detect the pitch ranges of a singing voice in each time frame. The detected trend substantially reduces the difficulty of singing pitch detection by removing a large number of wrong pitch candidates either produced by musical instruments or the overtones of the singing voice. Systematic evaluation shows that the tandem algorithm outperforms previous systems for pitch extraction and singing voice separation.
  • Keywords
    iterative methods; musical instruments; source separation; speech synthesis; CASA; computational auditory scene analysis; iterative method; music accompaniment; musical instruments; musical recordings; pitch range detection; singing pitch estimation; singing pitch extraction; singing voice separation; tandem algorithm; trend estimation algorithm; Estimation; Harmonic analysis; Hidden Markov models; Instruments; Spectrogram; Speech; Time frequency analysis; Computational auditory scene analysis (CASA); iterative procedure; pitch extraction; singing voice separation; tandem algorithm;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2182510
  • Filename
    6121941