• DocumentCode
    3457701
  • Title

    Knowledge discovery in scientific databases using text mining and social network analysis

  • Author

    Jalalimanesh, A.

  • Author_Institution
    Inf. Eng. Dept., Iranian Res. Inst. for Inf. Sci. & Technol., Tehran, Iran
  • fYear
    2012
  • fDate
    23-26 Sept. 2012
  • Firstpage
    46
  • Lastpage
    49
  • Abstract
    This paper introduces a novel methodology to extract core concepts from text corpus. This methodology is based on text mining and social network analysis. At the text mining phase the keywords are extracted by tokenizing, removing stop-lists and generating N-grams. Network analysis phase includes co-word occurrence extraction, network representation of linked terms and calculating centrality measure. We applied our methodology on a text corpus including 650 thesis titles in the domain of Industrial engineering. Interpreting enriched networks was interesting and gave us valuable knowledge about corpus content.
  • Keywords
    data mining; database management systems; information retrieval; natural sciences computing; social networking (online); text analysis; N-gram; calculating centrality measure; corpus content; coword occurrence extraction; industrial engineering; keyword extraction; knowledge discovery; network representation; scientific database; social network analysis; text corpus; text mining; tokenization; Maintenance engineering; Visualization; Industrial engineering; Knowledge discovery; Social network analysis; Text mining; concept mapping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control, Systems & Industrial Informatics (ICCSII), 2012 IEEE Conference on
  • Conference_Location
    Bandung
  • Print_ISBN
    978-1-4673-1022-2
  • Type

    conf

  • DOI
    10.1109/CCSII.2012.6470471
  • Filename
    6470471