• DocumentCode
    2977125
  • Title

    Detecting the Theft of Natural Language Text Using Birthmark

  • Author

    Yang, Jianlong ; Wang, Jianmin ; Li, Deyi

  • Author_Institution
    Tsinghua University, China
  • fYear
    2006
  • fDate
    Dec. 2006
  • Firstpage
    699
  • Lastpage
    702
  • Abstract
    To detect the theft of natural language text effectively, we present a novel scheme to derive birthmark from the text. Since birthmark is a unique and native characteristic of every text, a text with the same birthmark of another can be easily suspected of a copy. Ideally, birthmark should satisfy two properties: (a) credibility - independent texts must be distinguished by completely different birthmarks, and (b) resilience - birthmark should be tolerant against meaningpreserving attacks. To evaluate the effectiveness of the proposed birthmark, we conduct two experiments. The first one shows that birthmark successfully distinguishes non-copied files. In the second one, it shows that birthmark has quite good a tolerance against meaning-preserving attacks.
  • Keywords
    Data mining; Natural languages; Q measurement; Resilience; Signal processing; Software protection; Watermarking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Hiding and Multimedia Signal Processing, 2006. IIH-MSP '06. International Conference on
  • Conference_Location
    Pasadena, CA, USA
  • Print_ISBN
    0-7695-2745-0
  • Type

    conf

  • DOI
    10.1109/IIH-MSP.2006.265097
  • Filename
    4041817