DocumentCode :
2126102
Title :
Polish N-Grams and Their Correction Process
Author :
Ziólko, Bartosz ; Skurzok, Dawid ; Michalska, Malgorzata
Author_Institution :
Dept. of Electron., AGH Univ. of Sci. & Technol., Kraków, Poland
fYear :
2010
fDate :
11-13 Aug. 2010
Firstpage :
1
Lastpage :
5
Abstract :
Word n-gram statistics collected from over 1 300 000 000 words are presented. Eventhough they were collected from various good sources, they contain several types of errors. The paper focuses on the process of partly supervised correction of the n- grams. Types of errors are described as well as our software allowing efficient and fast corrections.
Keywords :
software engineering; speech recognition; Polish language; supervised correction; word n-gram statistic; Dictionaries; Electronic publishing; Encyclopedias; Internet; Software; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Ubiquitous Engineering (MUE), 2010 4th International Conference on
Conference_Location :
Cebu
Print_ISBN :
978-1-4244-7563-6
Type :
conf
DOI :
10.1109/MUE.2010.5575068
Filename :
5575068
Link To Document :
بازگشت