• DocumentCode
    699196
  • Title

    Several features for discrimination between vocal sounds and other environmental sounds

  • Author

    Shi Yuan-Yuan ; Wen Xue ; She Bin

  • Author_Institution
    HCI Lab., Samsung Adv. Inst. of Technol., Yongin, South Korea
  • fYear
    2004
  • fDate
    6-10 Sept. 2004
  • Firstpage
    2099
  • Lastpage
    2102
  • Abstract
    Several features are found to discriminate between the vocals sounds and other environmental sounds. The vocal sounds include speaking, laughter, etc., 23 kinds of nonverbal and verbal sounds; and the environmental sounds are recorded in domestic environments. The discriminative features are selected from 22 kinds of features. They are the speech recognition features of LPCC and MFCC, time-spectral features from FFT, statistics of pitch values and contour, ratio of voiced and unvoiced segments, and spectrum of pitch contour. The 9 features calculated from pitch contours perform much better than the features calculated from spectrums, which show no discriminability. The classification is performed simply by a neural network to evaluate the performance of the 9 features. They are tested on a 21 CDs environmental sound database. And the hit rate of 9 8.7 3% with the false alarm rate of 11 % are obtained. The classification result confirms the effectiveness and efficiency of the features.
  • Keywords
    fast Fourier transforms; feature extraction; neural nets; speech recognition; FFT; LPCC; MFCC; environmental sounds; neural network; pitch contours; speech recognition features; time-spectral features; vocal sounds; Abstracts; Artificial neural networks; Inductors; Manganese; Mel frequency cepstral coefficient; Pediatrics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2004 12th European
  • Conference_Location
    Vienna
  • Print_ISBN
    978-320-0001-65-7
  • Type

    conf

  • Filename
    7079726