• DocumentCode
    79538
  • Title

    Distributing Recognition in Computational Paralinguistics

  • Author

    Zixing Zhang ; Coutinho, Eduardo ; Jun Deng ; Schuller, Bjorn

  • Author_Institution
    Machine Intell. & Signal Process. Group, Tech. Univ. Munchen, Munich, Germany
  • Volume
    5
  • Issue
    4
  • fYear
    2014
  • fDate
    Oct.-Dec. 1 2014
  • Firstpage
    406
  • Lastpage
    417
  • Abstract
    In this paper, we propose and evaluate a distributed system for multiple Computational Paralinguistics tasks in a client-server architecture. The client side deals with feature extraction, compression, and bit-stream formatting, while the server side performs the reverse process, plus model training, and classification. The proposed architecture favors large-scale data collection and continuous model updating, personal information protection, and transmission bandwidth optimization. In order to preliminarily investigate the feasibility and reliability of the proposed system, we focus on the trade-off between transmission bandwidth and recognition accuracy. We conduct large-scale evaluations of some key functions, namely, feature compression/decompression, model training and classification, on five common paralinguistic tasks related to emotion, intoxication, pathology, age and gender. We show that, for most tasks, with compression ratios up to 40 (bandwidth savings up to 97.5 percent), the recognition accuracies are very close to the baselines. Our results encourage future exploitation of the system proposed in this paper, and demonstrate that we are not far from the creation of robust distributed multi-task paralinguistic recognition systems which can be applied to a myriad of everyday life scenarios.
  • Keywords
    client-server systems; computational linguistics; data compression; pattern classification; bit-stream formatting; client-server architecture; computational paralinguistics; continuous model updating; data classification; data collection; data compression; distributed system; feature extraction; multitask paralinguistic recognition systems; personal information protection; plus model training; reverse process; transmission bandwidth optimization; Bandwidth; Computational modeling; Distributed processing; Feature extraction; Linguistics; Speech coding; Speech recognition; Computational paralinguistics; distributed recognition system; emotion; split vector quantization;
  • fLanguage
    English
  • Journal_Title
    Affective Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1949-3045
  • Type

    jour

  • DOI
    10.1109/TAFFC.2014.2359655
  • Filename
    6906228