DocumentCode
388117
Title
A new system for reliable pitch extraction of speech
Author
Fujisaki, Hiroya ; Hirose, Keikichi ; Shimizu, Keisuke
Author_Institution
University of Tokyo, Bunkyo-ku, Tokyo, Japan
Volume
12
fYear
1987
fDate
31868
Firstpage
2422
Lastpage
2425
Abstract
The causes of errors in conventional pitch extraction methods can be classified into two categories: 1) extrinsic factors that come from the analysis methods, such as imperfect signal representation due to inappropriate shape, width, and placement of the analysis window, etc., and 2) intrinsic factors that reside in the speech signal itself, such as the occurrence of a strong harmonic component or a sub-harmonic component, etc. In fact, fixed frame size and frame shift, adopted in most of the conventional systems, are responsible for a considerable part of their gross pitch errors. In this paper we combine the use of running waveform analysis, an exponential analysis window, and a variable frame shift to cope with errors of the first category. Various new methods are also introduced to cope with causes of errors of the second category. The validity of the proposed methods is confirmed by experiments using speech materials from both male and female speakers.
Keywords
Data mining; Harmonic analysis; Proposals; Reliability engineering; Shape; Signal analysis; Signal representations; Speech analysis; Speech coding; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type
conf
DOI
10.1109/ICASSP.1987.1169926
Filename
1169926
Link To Document