DocumentCode :
130357
Title :
Data cleansing of the fire & rescue text corpus. The case study of correction of the misspellings and segmentation into sentences
Author :
Krenski, Karol ; Fliszkiewicz, Mateusz
Author_Institution :
Sect. of Comput. Sci., Main Sch. of Fire Service, Warsaw, Poland
fYear :
2014
fDate :
7-10 Sept. 2014
Firstpage :
331
Lastpage :
335
Abstract :
The article presents a case study of applying data cleansing methods and segmentation procedures in order to correct and enhance the structure of the domain corpus of fire service. During the study we present our approach and the results in the task of correcting the misspellings, as well as the method of segmenting the corpus into sentences.
Keywords :
emergency services; fires; text analysis; data cleansing method; fire & rescue text corpus; fire service; misspelling correction; sentence segmentation procedure; Buildings; Context; Databases; Dictionaries; Fires; Semantics; Vocabulary; Data Cleansing; Fire Service; Misspellings; Segmentation; Text Corpus;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Systems (FedCSIS), 2014 Federated Conference on
Conference_Location :
Warsaw
Type :
conf
DOI :
10.15439/2014F406
Filename :
6933033
Link To Document :
بازگشت