DocumentCode
1780683
Title
Use of language as a cognitive biometric trait
Author
Pokhriyal, Neeti ; Nwogu, Ifeoma ; Govindaraju, Vengatesan
Author_Institution
Dept. of Comput. Sci., Univ. at Buffalo, New York, NY, USA
fYear
2014
fDate
Sept. 29 2014-Oct. 2 2014
Firstpage
1
Lastpage
8
Abstract
This paper investigates whether the cognitive state of a person can be learnt and used as a novel biometric trait. We explore the idea of using language written by an author, as his/her cognitive fingerprint. The dataset consists of millions of blogs written by thousands of authors on the Internet. Our proposed method learns a classifier that can distinguish between genuine and impostor authors. Our results are encouraging (we report 72% Area under the ROC curve) and show that users do have a distinctive linguistic style, which is evident even when analyzing a corpora as large and diverse as the Internet. When we tested on new authors that the system had never encountered before, our methodology correctly identified genuine authors with 78% accuracy and impostors with 76% accuracy.
Keywords
Web sites; biometrics (access control); cognition; computational linguistics; learning (artificial intelligence); pattern classification; sensitivity analysis; area under-the-ROC curve; author blogs; classifier learning; cognitive biometric trait; cognitive fingerprint; corpora analysis; genuine authors; impostor authors; linguistic style; person cognitive state; Blogs; Feature extraction; Internet; Vectors; Vocabulary; Writing; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Biometrics (IJCB), 2014 IEEE International Joint Conference on
Conference_Location
Clearwater, FL
Type
conf
DOI
10.1109/BTAS.2014.6996291
Filename
6996291
Link To Document