• DocumentCode
    151825
  • Title

    Archaisms and neologisms identification in texts

  • Author

    Costin-Gabriel, Chiru ; Rebedea, Traian Eugen

  • Author_Institution
    Comput. Sci. Dept., Politeh. Univ. of Bucharest, Bucharest, Romania
  • fYear
    2014
  • fDate
    11-13 Sept. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper we present an application for identifying archaisms and neologisms in texts. The application also provides the ability to view graphically the evolution trends of these words for a better interpretation of the results. The presented solution consists of two phases: the learning phase in which we identify the general evolution trends of three categories of words (archaisms, neologisms and common words) and the classification phase in which we label new words with their corresponding category. For both phases, the application requires Internet access because it is using the Google Books N-gram Viewer to generate the images that back up the decisions.
  • Keywords
    Internet; natural language processing; pattern classification; text analysis; Google books n-gram viewer; Internet access; archaisms identification; classification phase; learning phase; natural language processing; neologisms identification; text mining; Dictionaries; Google; Market research; Natural language processing; Principal component analysis; Standards; Transforms; NLP; PCA; archaisms; neologisms; text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    RoEduNet Conference 13th Edition: Networking in Education and Research Joint Event RENAM 8th Conference, 2014
  • Conference_Location
    Chisinau
  • ISSN
    2068-1038
  • Print_ISBN
    978-1-4799-6860-2
  • Type

    conf

  • DOI
    10.1109/RoEduNet-RENAM.2014.6955312
  • Filename
    6955312