• DocumentCode
    1473193
  • Title

    An Ontology-Based Text-Mining Method to Cluster Proposals for Research Project Selection

  • Author

    Ma, Jian ; Xu, Wei ; Sun, Yong-hong ; Turban, Efraim ; Wang, Shouyang ; Liu, Ou

  • Author_Institution
    Dept. of Inf. Syst., City Univ. of Hong Kong, Kowloon, China
  • Volume
    42
  • Issue
    3
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    784
  • Lastpage
    790
  • Abstract
    Research project selection is an important task for government and private research funding agencies. When a large number of research proposals are received, it is common to group them according to their similarities in research disciplines. The grouped proposals are then assigned to the appropriate experts for peer review. Current methods for grouping proposals are based on manual matching of similar research discipline areas and/or keywords. However, the exact research discipline areas of the proposals cannot often be accurately designated by the applicants due to their subjective views and possible misinterpretations. Therefore, rich information in the proposals´ full text can be used effectively. Text-mining methods have been proposed to solve the problem by automatically classifying text documents, mainly in English. However, these methods have limitations when dealing with non-English language texts, e.g., Chinese research proposals. This paper presents a novel ontology-based text-mining approach to cluster research proposals based on their similarities in research areas. The method is efficient and effective for clustering research proposals with both English and Chinese texts. The method also includes an optimization model that considers applicants´ characteristics for balancing proposals by geographical regions. The proposed method is tested and validated based on the selection process at the National Natural Science Foundation of China. The results can also be used to improve the efficiency and effectiveness of research project selection processes in other government and private research funding agencies.
  • Keywords
    data mining; government data processing; ontologies (artificial intelligence); optimisation; pattern classification; pattern clustering; pattern matching; project management; research and development; text analysis; China; Chinese text; English; National Natural Science Foundation; government agency; grouping proposals; manual matching; nonEnglish language text; ontology-based text mining approach; optimization model; private research funding agency; research project selection process; research proposal clustering; text document classification; Clustering algorithms; Educational institutions; Humans; Ontologies; Proposals; Sun; Vectors; Clustering analysis; decision support systems; ontology; research project selection; text mining;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4427
  • Type

    jour

  • DOI
    10.1109/TSMCA.2011.2172205
  • Filename
    6171866