• DocumentCode
    130357
  • Title

    Data cleansing of the fire & rescue text corpus. The case study of correction of the misspellings and segmentation into sentences

  • Author

    Krenski, Karol ; Fliszkiewicz, Mateusz

  • Author_Institution
    Sect. of Comput. Sci., Main Sch. of Fire Service, Warsaw, Poland
  • fYear
    2014
  • fDate
    7-10 Sept. 2014
  • Firstpage
    331
  • Lastpage
    335
  • Abstract
    The article presents a case study of applying data cleansing methods and segmentation procedures in order to correct and enhance the structure of the domain corpus of fire service. During the study we present our approach and the results in the task of correcting the misspellings, as well as the method of segmenting the corpus into sentences.
  • Keywords
    emergency services; fires; text analysis; data cleansing method; fire & rescue text corpus; fire service; misspelling correction; sentence segmentation procedure; Buildings; Context; Databases; Dictionaries; Fires; Semantics; Vocabulary; Data Cleansing; Fire Service; Misspellings; Segmentation; Text Corpus;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Systems (FedCSIS), 2014 Federated Conference on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.15439/2014F406
  • Filename
    6933033