• DocumentCode
    1532897
  • Title

    The Voice of Leadership: Models and Performances of Automatic Analysis in Online Speeches

  • Author

    Weninger, Felix ; Krajewski, Jarek ; Batliner, Anton ; Schuller, Bjorn

  • Author_Institution
    Inst. for HumanMachine Commun., Tech. Univ. Munchen, Munich, Germany
  • Volume
    3
  • Issue
    4
  • fYear
    2012
  • Firstpage
    496
  • Lastpage
    508
  • Abstract
    We introduce the automatic determination of leadership emergence by acoustic and linguistic features in online speeches. Full realism is provided by the varying and challenging acoustic conditions of the presented YouTube corpus of online available speeches labeled by 10 raters and by processing that includes Long Short-Term Memory-based robust voice activity detection (VAD) and automatic speech recognition (ASR) prior to feature extraction. We discuss cluster-preserving scaling of 10 original dimensions for discrete and continuous task modeling, ground truth establishment, and appropriate feature extraction for this novel speaker trait analysis paradigm. In extensive classification and regression runs, different temporal chunkings and optimal late fusion strategies (LFSs) of feature streams are presented. In the result, achievers, charismatic speakers, and teamplayers can be recognized significantly above chance level, reaching up to 72.5 percent accuracy on unseen test data.
  • Keywords
    feature extraction; speaker recognition; YouTube corpus; automatic analysis; automatic determination; automatic speech recognition; charismatic speakers; chunkings; cluster preserving scaling; continuous task modeling; feature extraction; full realism; leadership; long short term memory based robust voice activity detection; online speeches; optimal late fusion strategy; speaker trait analysis paradigm; Acoustics; Ethics; Linguistics; Pragmatics; Speech recognition; Training; YouTube; Personality analysis; acoustic/linguistic fusion; dimensional analysis;
  • fLanguage
    English
  • Journal_Title
    Affective Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1949-3045
  • Type

    jour

  • DOI
    10.1109/T-AFFC.2012.15
  • Filename
    6212433