• DocumentCode
    1203241
  • Title

    Achieving Microaggregation for Secure Statistical Databases Using Fixed-Structure Partitioning-Based Learning Automata

  • Author

    Fayyoumi, Ebaa ; Oommen, John B.

  • Author_Institution
    Sch. of Comput. Sci., Carleton Univ., Ottawa, ON
  • Volume
    39
  • Issue
    5
  • fYear
    2009
  • Firstpage
    1192
  • Lastpage
    1205
  • Abstract
    We consider the microaggregation problem (MAP) that involves partitioning a set of individual records in a microdata file into a number of mutually exclusive and exhaustive groups. This problem, which seeks for the best partition of the microdata file, is known to be NP-hard and has been tackled using many heuristic solutions. In this paper, we present the first reported fixed-structure-stochastic-automata-based solution to this problem. The newly proposed method leads to a lower value of the information loss (IL), obtains a better tradeoff between the IL and the disclosure risk (DR) when compared with state-of-the-art methods, and leads to a superior value of the scoring index, which is a criterion involving a combination of the IL and the DR. The scheme has been implemented, tested, and evaluated for different real-life and simulated data sets. The results clearly demonstrate the applicability of learning automata to the MAP and its ability to yield a solution that obtains the best tradeoff between IL and DR when compared with the state of the art.
  • Keywords
    automata theory; learning (artificial intelligence); statistical databases; NP-hard; disclosure risk; fixed structure partitioning; fixed structure stochastic automata; information loss; learning automata; microaggregation problem; microdata file partitioning; records partitioning; scoring index; secure statistical databases; Disclosure risk (DR); information loss (IL); microaggregation technique (MAT); object migrating microaggregated automaton (OMMA); Algorithms; Artificial Intelligence; Computer Security; Computer Simulation; Database Management Systems; Databases, Factual; Models, Statistical; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2009.2013723
  • Filename
    4804703