• DocumentCode
    3307283
  • Title

    Diachronic Corpus and Linguistic Space: New Methods for the Analysis of Language Change

  • Author

    Yamamoto, Hilofumi ; Tanaka, Makiro ; Kondo, Yasuhiro

  • Author_Institution
    Tokyo Inst. of Technol., Tokyo, Japan
  • fYear
    2012
  • fDate
    8-10 Aug. 2012
  • Firstpage
    381
  • Lastpage
    384
  • Abstract
    The project, design and development of the diachronic corpus of Japanese began in 2009 at the Department of Corpus Study, the National Institute of Japanese Language and Linguistics, Japan (NINJAL), as a collaborative research project by linguists and literature scholars of NINJAL and the University of Oxford. Its focus is on collecting representative Japanese literary works and classical documents from the tenth century to the nineteenth century. We are currently working on the development of the prototype version of the diachronic Japanese corpus: i.e. selection of materials, digitization of texts, addition of alternative texts (containing different orthography) to original texts, compilation of a basic thesaurus that differentiates between different spellings, and word segmentation. This paper addresses the discussion of the basic concepts encountered during our work on the project: synchronic and diachronic analysis, which led us to the design of a serial comparison model which allows us to examine language change between documents or literary works with respect to time.
  • Keywords
    linguistics; natural language processing; research and development; text analysis; NINJAL; University of Oxford; classical documents; collaborative research project; diachronic Japanese corpus; diachronic analysis; language change analysis; linguistic space; national institute of Japanese language and linguistics; representative Japanese literary works; synchronic analysis; thesaurus; word segmentation; Analytical models; Collaboration; Educational institutions; History; Materials; Pragmatics; Thesauri; diachronic; differential of lexical component; history of Japanese; language change; serial comparison model; synchronic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking and Parallel & Distributed Computing (SNPD), 2012 13th ACIS International Conference on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4673-2120-4
  • Type

    conf

  • DOI
    10.1109/SNPD.2012.104
  • Filename
    6299310