DocumentCode :
723477
Title :
LaNCoA: A Python toolkit for Language Networks Construction and Analysis
Author :
Margan, Domagoj ; Mestrovic, Ana
Author_Institution :
Dept. of Inf., Univ. of Rijeka, Rijeka, Croatia
fYear :
2015
fDate :
25-29 May 2015
Firstpage :
1628
Lastpage :
1633
Abstract :
In this paper we describe LaNCoA, Language Networks Construction and Analysis toolkit implemented in Python. The toolkit provides various procedures for network construction from the text: on the word-level (co-occurrence networks, syntactic networks, shuffled networks), and on the subword-level (syllable networks, grapheme networks). Furthermore, we implement functions for the language networks analysis on the global and local level. The toolkit is organized in several modules that enable various aspects of language analysis: analysis of global network measures for different co-occurrence window, comparison of networks based on original and shuffled texts, comparison of networks constructed on different language levels, etc. Text manipulation methods, like corpora cleaning, lemmatization and stopwords removal, are also implemented. For the basic network representation we use available NetworkX functions and methods. However, language network analysis is specific and it requires implementation of additional functions and methods. That was the main motivation for this research.
Keywords :
programming languages; LaNCoA; NetworkX functions; cooccurrence networks; cooccurrence window; corpora cleaning; global level; grapheme networks; language levels; language networks construction; lemmatization; local level; python toolkit; shuffled networks; stopwords removal; subword-level; syllable networks; syntactic networks; text manipulation methods; word-level; Cleaning; Complex networks; Pragmatics; Semantics; Standards; Syntactics; Weight measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2015 38th International Convention on
Conference_Location :
Opatija
Type :
conf
DOI :
10.1109/MIPRO.2015.7160532
Filename :
7160532
Link To Document :
بازگشت