• DocumentCode
    542222
  • Title

    Spline-based continuous-time pitch estimation

  • Author

    Jefremov, Andrei ; Kleijn, W. Bastiaan

  • Author_Institution
    Department of Speech, Music and Hearing, KTH (Royal Institute of Technology), 10044 Stockholm, Sweden
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Pitch-synchronous speech coding algorithms can achieve low bit rates without compromising the quality. However, the effectiveness of pitch-synchronous coding depends strongly on the ability to estimate precisely and reliably the fundamental period of the speech signal. We present a novel pitch postprocessing method that significantly improves the accuracy and reliability of pitch estimation. In contrast to the classical schemes, the pitch is treated as a continuous function in time and amplitude. B-Spline signal processing, half wave rectification, and multi-stage, multi-resolution optimization are essential parts of the procedure. The performance of the method is evaluated objectively and subjectively using the Waveform Interpolation coder. The objective results show that, for voiced segments, the method significantly (60% on average) decreases the energy of the unvoiced component estimate compared to using an unprocessed pitch. Listening tests show a 90% preference of speech generated using our postprocessor over speech generated using a conventional method.
  • Keywords
    Auditory system; Optimization; Speech; Spline;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743723
  • Filename
    5743723