Title :
Disambiguate Chinese personal pronoun based on semantic structure
Author :
Wei, Xiangfeng ; Zang, Hanfen ; Zhang, Quan
Author_Institution :
Inst. of Acoust., Chinese Acad. of Sci., Beijing
Abstract :
It is a very difficult problem in natural language processing to resolve the ambiguity of personal pronouns anaphora in a sentence or paragraph by computer according to semantic expression. Firstly, this paper focuses on finding out the personal names based on the maximal entropy model. Secondly, it categorizes Semantic Chunks and Sentence Category(SCs) based on the HNC theory. Thirdly, we chose 40 paragraphs in 2004 Athens Olympics as training corpus to make up the personal pronouns disambiguating rules. Finally, we chose another 40 paragraphs to exam the rules and processing steps by simulating computerpsilas processing manually. We got a very high precision. Therefore, based on parallel semantic structure of HNC, the approach is effective for the disambiguating of Chinese personal pronouns.
Keywords :
computational linguistics; maximum entropy methods; natural language processing; HNC theory; disambiguate Chinese personal pronoun anaphora; maximal entropy model; natural language processing; parallel semantic structure expression; semantic chunk; sentence category; Acoustics; Appraisal; Character recognition; Computational modeling; Computer simulation; Data mining; Entropy; Natural language processing; Robustness; Statistical distributions;
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
DOI :
10.1109/GRC.2008.4664717