مرکز منطقه ای اطلاع رساني علوم و فناوري - A Statistical Language Modeling Approach to Online Deception Detection

DocumentCode :

945687

Title :

A Statistical Language Modeling Approach to Online Deception Detection

Author :

Zhou, Lina ; Shi, Yongmei ; Zhang, Dongsong

Author_Institution :

Dept. of Inf. Syst., Univ. of Maryland, Baltimore, MD

Volume :

Issue :

fYear :

2008

Firstpage :

1077

Lastpage :

1081

Abstract :

Online deception is disrupting our daily life, organizational process, and even national security. Existing approaches to online deception detection follow a traditional paradigm by using a set of cues as antecedents for deception detection, which may be hindered by ineffective cue identification. Motivated by the strength of statistical language models (SLMs) in capturing the dependency of words in text without explicit feature extraction, we developed SLMs to detect online deception. We also addressed the data sparsity problem in building SLMs in general and in deception detection in specific using smoothing and vocabulary pruning techniques. The developed SLMs were evaluated empirically with diverse datasets. The results showed that the proposed SLM approach to deception detection outperformed a state-of-the-art text categorization method as well as traditional feature-based methods.

Keywords :

data mining; learning (artificial intelligence); programming languages; security of data; cue identification; data sparsity problem; national security; online deception detection; organizational process; smoothing-vocabulary pruning techniques; statistical language modeling approach; Machine learning; classification; knowledge management applications; language models; security; text mining;

fLanguage :

English

Journal_Title :

Knowledge and Data Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1041-4347

Type :

jour

DOI :

10.1109/TKDE.2007.190624

Filename :

4358936

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=945687