Title :
A data quality improvement method based on non-word errors correction
Author :
Keting Yin ; Shan Wang ; Zirui Liu ; Qi Yu ; Bo Zhou
Author_Institution :
Sch. of Software Technol., Zhejiang Univ., Ningbo, China
Abstract :
Spelling errors of data entry is an important factor which influences banking data quality. Based on banking information system, we study non-word spelling errors occurring in the process of typing in with keyboard. The fingering of keyboarders and QWERTY international keyboard layout will be taken into account in the division of the letters. 26 letters will be divided into 5 types according to fingering and keyboard partitions, and accordingly, a mathematical model based on keyboard probability will be proposed. Combined with effective labeling and sequencing methods of erroneous words, this model will further lead to a recommended list of misspelled words. Case study based on this model is carried out and a process of correcting the nonword spelling errors is demonstrated. The study shows that the method proposed in this paper will effectively produce the recommended list of misspelled words and improve the quality of data entry.
Keywords :
bank data processing; keyboards; spelling aids; text analysis; QWERTY international keyboard layout; banking data quality; banking information system; data entry; data quality improvement method; erroneous words; keyboard partitions; keyboard probability; keyboarders fingering; labeling methods; mathematical model; misspelled words; nonword errors correction; nonword spelling errors; sequencing methods; typing; Banking; Benchmark testing; Fingers; Keyboards; Law; Mathematical model; data quality; non-word spelling errors; spelling error correction;
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-3278-8
DOI :
10.1109/ICSESS.2014.6933721