• DocumentCode
    3367120
  • Title

    Application of Language Models to Suspect Prioritisation and Suspect Likelihood in Serial Crimes

  • Author

    Bache, Richard ; Crestani, Fabio ; Canter, David ; Youngs, Donna

  • Author_Institution
    Univ. of Strathclyde, Glasgow
  • fYear
    2007
  • fDate
    29-31 Aug. 2007
  • Firstpage
    399
  • Lastpage
    404
  • Abstract
    Language models are successfully applied to the problem of analysing crime descriptions from a police database with the purpose of prioritising suspects for an unsolved crime, given details of solved crimes. The frequency of terms in each description relates to the behaviour of the offender and this can be used to link crimes to a common offender. Language modelling uses Bayes´ theorem and thus require a prior probability. Such a prior can be based on each offender´s past propensity to offend, derived from historic data. Language modelling yields a probability of a document being relevant, which in this case is interpreted as the probability of a suspect being the culprit. Although the absolute value of the probability does not carry any direct applied implications, the study does show that the general likelihood of identification of the actual suspect does correspond to the relative values. Thus these probabilities can be used for more than just ranking suspects.
  • Keywords
    Bayes methods; natural languages; police data processing; probability; Bayes theorem; language model; police database; probability; serial crime description analysis; suspect likelihood; suspect prioritisation; Computer crime; Data analysis; Data security; Databases; Frequency; Information analysis; Information retrieval; Information science; Information security; Psychology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Assurance and Security, 2007. IAS 2007. Third International Symposium on
  • Conference_Location
    Manchester
  • Print_ISBN
    0-7695-2876-7
  • Electronic_ISBN
    978-0-7695-2876-2
  • Type

    conf

  • DOI
    10.1109/IAS.2007.58
  • Filename
    4299806