• DocumentCode
    665682
  • Title

    Automated detection of anomalous shipping manifests to identify illicit trade

  • Author

    Sanfilippo, Antonio ; Chikkagoudar, Satish

  • Author_Institution
    Comput. & Stat. Div., Pacific Northwest Nat. Lab., Richland, WA, USA
  • fYear
    2013
  • fDate
    12-14 Nov. 2013
  • Firstpage
    529
  • Lastpage
    534
  • Abstract
    We describe an approach to analyzing anomalies in trade data based on the identification of cluster outliers. The approach uses unsupervised machine learning methods to discover semantically coherent clusters of shipping records in large collections of trade data. Trade data with cluster annotations are then used as input to a supervised machine learning algorithm to train and evaluate a classification model capable of identifying members of each cluster. The evaluation of this classification model provides an assessment of cluster coherence. Outliers are identified for each cluster by measuring the Euclidean distance from each member of the cluster to the cluster centroid, and then selecting a percentile threshold to identify shipping records with extreme distances from the cluster centroid. We describe a specific application of this approach to a dataset of 2.36M records for containerized shipments, with specific reference to the detection of anomalies potentially related to nuclear smuggling. Results show that this approach succeeds in finding semantically coherent clusters of shipping records, and identifying outliers that may help facilitate the detection of illicit trade.
  • Keywords
    business data processing; data analysis; pattern classification; pattern clustering; unsupervised learning; anomalous shipping manifests detection; classification model; cluster centroid; cluster outliers identification; data analysis; illicit trade identification; nuclear smuggling; percentile threshold; shipping records; supervised machine learning algorithm; trade data anomalies; unsupervised machine learning methods; Clustering algorithms; Earth; Euclidean distance; Inspection; Materials; Ores; Rocks; classification; clustering; detection of radiological threat materials; illicit trafficking; nuclear smuggling; trade data; visual analytics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies for Homeland Security (HST), 2013 IEEE International Conference on
  • Conference_Location
    Waltham, MA
  • Print_ISBN
    978-1-4799-3963-3
  • Type

    conf

  • DOI
    10.1109/THS.2013.6699059
  • Filename
    6699059