• DocumentCode
    388117
  • Title

    A new system for reliable pitch extraction of speech

  • Author

    Fujisaki, Hiroya ; Hirose, Keikichi ; Shimizu, Keisuke

  • Author_Institution
    University of Tokyo, Bunkyo-ku, Tokyo, Japan
  • Volume
    12
  • fYear
    1987
  • fDate
    31868
  • Firstpage
    2422
  • Lastpage
    2425
  • Abstract
    The causes of errors in conventional pitch extraction methods can be classified into two categories: 1) extrinsic factors that come from the analysis methods, such as imperfect signal representation due to inappropriate shape, width, and placement of the analysis window, etc., and 2) intrinsic factors that reside in the speech signal itself, such as the occurrence of a strong harmonic component or a sub-harmonic component, etc. In fact, fixed frame size and frame shift, adopted in most of the conventional systems, are responsible for a considerable part of their gross pitch errors. In this paper we combine the use of running waveform analysis, an exponential analysis window, and a variable frame shift to cope with errors of the first category. Various new methods are also introduced to cope with causes of errors of the second category. The validity of the proposed methods is confirmed by experiments using speech materials from both male and female speakers.
  • Keywords
    Data mining; Harmonic analysis; Proposals; Reliability engineering; Shape; Signal analysis; Signal representations; Speech analysis; Speech coding; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1987.1169926
  • Filename
    1169926