• DocumentCode
    3589933
  • Title

    On the path to mine multi word expressions from Slovak web space

  • Author

    Telepovska, H. ; Baco, M. ; Genci, J. ; Olostiak, M.

  • Author_Institution
    Tech. Univ. of Kosice, Kosice, Slovakia
  • fYear
    2014
  • Firstpage
    133
  • Lastpage
    138
  • Abstract
    The paper presents the current ongoing project aimed at documenting static and dynamic characteristics of the Slovak website. The aim of the first part of the project is to elaborate statistics regarding number of second level domains in Slovak web space. Other additional information about each domain have been processed such as determining whether the domain is functional or is dead, recording the relevant IP address, the effort to determine the period of change content domain, etc. Continuously generated data, moreover, allow presenting the dynamics of changes in the number of domains, or some attributes. In the second part of the project we plan to focus on getting static and dynamic characteristics of the Slovak vocabulary - mapping the current vocabulary, watching new words or phrases and so on.
  • Keywords
    IP networks; Web sites; data mining; vocabulary; IP address; Slovak Web space; Slovak vocabulary mapping; Slovak website; change dynamics; dynamic characteristics; multiword expressions mining; static characteristics; Conferences; Databases; Dictionaries; Electronic learning; Partitioning algorithms; Vocabulary; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging eLearning Technologies and Applications (ICETA), 2014 IEEE 12th International Conference on
  • Print_ISBN
    978-1-4799-7739-0
  • Type

    conf

  • DOI
    10.1109/ICETA.2014.7107559
  • Filename
    7107559