• DocumentCode
    62562
  • Title

    Data Mining applied to Forensic Speaker Identification

  • Author

    Univaso, Pedro ; Ale, Juan Maria ; Gurlekian, Jorge A.

  • Author_Institution
    INIGEM, Lab. de Investig. Sensoriales, UBA, Buenos Aires, Argentina
  • Volume
    13
  • Issue
    4
  • fYear
    2015
  • fDate
    Apr-15
  • Firstpage
    1098
  • Lastpage
    1111
  • Abstract
    In this paper we analyze the advantages of using data mining techniques and tools for data fusion in forensic speaker recognition. Segmental and suprasegmental features were employed in 28 different classifiers, in order to compare their performances. The selected classifiers have different learning techniques: lazy or instance-based, eager and ensemble. Two approaches were employed on the classification task: the use of all features and the use of a feature subset, selected with a gain ratio methodology. The best performances, with all features, were obtained by three classifiers: Logistic Model Tree (eager), LogitBoost (ensemble) and Multilayer Perceptron (eager). Support Vector Machine (eager) proved to be a good classifier if a Pearson VII function-based universal kernel was used. When low dimensional features were selected, ensemble classifiers exceeded the performance of all others classifiers. Segmental and tone features demonstrated the best speaker discrimination capabilities, followed by duration and quality voice features. Evaluation was performed on Argentine-Spanish voice samples from the Speech_Dat database recorded on a fixed telephone environment. Different recording sessions and channels for the test segments were added and the Z-norm procedure was applied for channel compensation.
  • Keywords
    data mining; feature selection; learning (artificial intelligence); multilayer perceptrons; sensor fusion; signal classification; speaker recognition; support vector machines; Argentine-Spanish voice samples; LogitBoost; Pearson VII function-based universal kernel; Z-norm procedure; channel compensation; classification task; data fusion; data mining techniques; eager learning techniques; ensemble classifiers; ensemble learning techniques; fixed telephone environment; forensic speaker identification; forensic speaker recognition; gain ratio methodology; instance-based learning techniques; lazy learning techniques; logistic model tree; low dimensional feature selection; multilayer perceptron; segmental features; speech_Dat database; support vector machine; suprasegmental features; test segments; voice feature quality; Adaptation models; Bagging; Data mining; Forensics; Hidden Markov models; Logistics; Support vector machines; Classifiers; Data Fusion; Data Mining; Ensemble Methods; Speaker Recognition;
  • fLanguage
    English
  • Journal_Title
    Latin America Transactions, IEEE (Revista IEEE America Latina)
  • Publisher
    ieee
  • ISSN
    1548-0992
  • Type

    jour

  • DOI
    10.1109/TLA.2015.7106363
  • Filename
    7106363