DocumentCode :
3206022
Title :
A New Document Masking Approach for Removing Confidential Information
Author :
Ikawa, Yohei ; Kanayama, Hiroshi
Author_Institution :
Tokyo Res. Lab., Tokyo
fYear :
2007
fDate :
23-26 July 2007
Firstpage :
107
Lastpage :
114
Abstract :
In order to protect confidential information such as personal and organizational information written as text, document masking techniques are becoming important. Such document masking methods extract humans, places, and organization names automatically and remove them, so they make documents harmless and allow sharing them safely within an organization, and contribute to improving productivity. However, existing automatic document masking techniques are not reliable enough since they may fail to mask out-of-vocabulary proper nouns. In this paper we propose a novel technique for document masking, the Unmasking Method, in which all of the words are hidden initially and a human specifies the non-confidential words to be unmasked. The proposed method is a high-safety document masking method since it unmasks only words that a human has manually recognized as safe. Our experimental results show its safety and effectiveness.
Keywords :
data privacy; feature extraction; text analysis; confidential information protection; document masking; feature extraction; text analysis; unmasking method; Data mining; Dictionaries; Humans; Laboratories; Productivity; Protection; Safety;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
E-Commerce Technology and the 4th IEEE International Conference on Enterprise Computing, E-Commerce, and E-Services, 2007. CEC/EEE 2007. The 9th IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7695-2913-5
Type :
conf
DOI :
10.1109/CEC-EEE.2007.8
Filename :
4285205
Link To Document :
بازگشت