DocumentCode
130357
Title
Data cleansing of the fire & rescue text corpus. The case study of correction of the misspellings and segmentation into sentences
Author
Krenski, Karol ; Fliszkiewicz, Mateusz
Author_Institution
Sect. of Comput. Sci., Main Sch. of Fire Service, Warsaw, Poland
fYear
2014
fDate
7-10 Sept. 2014
Firstpage
331
Lastpage
335
Abstract
The article presents a case study of applying data cleansing methods and segmentation procedures in order to correct and enhance the structure of the domain corpus of fire service. During the study we present our approach and the results in the task of correcting the misspellings, as well as the method of segmenting the corpus into sentences.
Keywords
emergency services; fires; text analysis; data cleansing method; fire & rescue text corpus; fire service; misspelling correction; sentence segmentation procedure; Buildings; Context; Databases; Dictionaries; Fires; Semantics; Vocabulary; Data Cleansing; Fire Service; Misspellings; Segmentation; Text Corpus;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Systems (FedCSIS), 2014 Federated Conference on
Conference_Location
Warsaw
Type
conf
DOI
10.15439/2014F406
Filename
6933033
Link To Document