DocumentCode :
1087378
Title :
A semiautomatic pitch detector (SAPD)
Author :
McGonegal, Carol A. ; Rabiner, Lawrence R. ; Rosenberg, Aaron E.
Author_Institution :
Bell Laboratories., Murray Hill, N.J.J
Volume :
23
Issue :
6
fYear :
1975
fDate :
12/1/1975 12:00:00 AM
Firstpage :
570
Lastpage :
574
Abstract :
The purpose of this paper is to describe a technique for semiautomatically determining the pitch contour of an utterance. The method is significantly more sophisticated than the standard technique of hand tracking of pitch periods from a waveform display of the utterance and leads to a fairly robust measurement of the pitch period. This technique utilizes a simultaneous display (on a 10 ms section-by-section basis) of the low-pass filtered waveform, the autocorrelation of a 400- point segment of the low-pass filtered waveform, and the cepstrum of the same 400-point segment of the wideband recording. For each of the separate displays (i.e., waveform, autocorrelation, and cepstrum) an independent estimate of the pitch period is made on an interactive basis with the computer, and the final pitch period decision is made by the user based on results of each of the measurements. The technique has been tested on a large number of utterances spoken by a variety of speakers with very good results. Formal tests of the method were made in which four people were asked to use the method on three different utterances, and their results were then compared. During voiced regions, the standard deviation in the value of the pitch period was about 0.5 samples across the four people. The standard deviation of the location of the time at which voiced regions became unvoiced, and vice versa was on the order of half a section duration, or 5 ms. The major limitation of the proposed method is that it requires about 30 min to analyze 1 s of speech. However, the increased accuracy and robustness of the results indicate that the tradeoff of time for accuracy is a good one for many applications.
Keywords :
Autocorrelation; Cepstrum; Computer displays; Detectors; Low pass filters; Measurement standards; Robustness; Speech analysis; Testing; Wideband;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1975.1162750
Filename :
1162750
Link To Document :
بازگشت