• DocumentCode
    2301442
  • Title

    Discriminative Training of GMM via Log-Likelihood Ratio for Abnormal Acoustic Event Classification in Vehicular Environment

  • Author

    Kim, Kwangyoun ; Ko, Hanseok

  • Author_Institution
    Sch. of Electr. Eng., Korea Univ., Seoul, South Korea
  • fYear
    2011
  • fDate
    23-25 May 2011
  • Firstpage
    348
  • Lastpage
    352
  • Abstract
    In this paper, a discriminative training technique based on Gaussian Mixture Model (GMM) is proposed for detection and classification of abnormal acoustic events in indoor environment. In particular, we consider small indoor space such as vehicular scenes and develop a two-step procedure in which statistical mapping of acoustic features is followed by abnormal event detection. In the first step, Mel-Frequency Cepstral Coefficients (MFCC) feature set is used to construct a Gaussian Mixture Model (GMM) for acoustic event mapping and log-likelihood ratio is used for confidence measure to correct misrecognition over vocal/nonvocal regions. In the 2nd step, an abnormal event is determined using maximum likelihood estimation approach wherein the ratio of abnormal events to cumulative events during an analysis window is compared to a threshold. For performance evaluation, we employ a statistically meaningful database of normal and abnormal acoustic events in actual indoor scenes of two representative scenarios. Subsequent experiments demonstrate a performance of 91% correct detection rate for abnormal context and 2.5% of error detection rate, which indicates it promising for real world vehicular acoustic surveillance applications.
  • Keywords
    Gaussian processes; acoustic signal detection; maximum likelihood estimation; surveillance; vehicles; GMM; Gaussian mixture model; MFCC feature set; abnormal acoustic event classification; abnormal acoustic event detection; acoustic event mapping; discriminative training technique; indoor environment; log-likelihood ratio; maximum likelihood estimation; mel-frequency cepstral coefficients; statistical mapping; vehicular environment; Context; Databases; Feature extraction; Mel frequency cepstral coefficient; Speech recognition; Vehicles; -context awareness; Acoustic-based surveillance; GMM; MFCC;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers, Networks, Systems and Industrial Engineering (CNSI), 2011 First ACIS/JNU International Conference on
  • Conference_Location
    Jeju Island
  • Print_ISBN
    978-1-4577-0180-1
  • Type

    conf

  • DOI
    10.1109/CNSI.2011.39
  • Filename
    5954340