Title :
On the occurrences of two successive words in a literary Romanian corpus
Author :
Mitrea, Adrian ; Vlad, Adriana ; Luca, Adrian
Author_Institution :
Fac. of Electron., Telecommun. & Inf. Technol., Politeh. Univ. of Bucharest, Bucharest, Romania
Abstract :
The goal of the study was to investigate the statistical structure of groups of two successive words (diagrams of words) in the literary field of printed Romanian. The paper brings into discussion the statistical effect that a preceding word has over the words following it. All the investigation presented here refers to natural language as a chain of words.
Keywords :
natural language processing; statistical analysis; word processing; literary Romanian corpus; natural language; statistical structure; Artificial intelligence; Books; Colon; Dictionaries; Frequency; Information technology; Natural languages; Probability; literary corpus linguistics; statistical structure of two successive words;
Conference_Titel :
Communications (COMM), 2010 8th International Conference on
Conference_Location :
Bucharest
Print_ISBN :
978-1-4244-6360-2
DOI :
10.1109/ICCOMM.2010.5509036