Title :
Identification and extraction of topic elements for Chinese interactive text
Author :
Zhu, Haiping ; Yang, Yang ; Chen, Yan ; Yu, Xiaolu
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´´an Jiaotong Univ., Xi´´an, China
Abstract :
This paper is based on identification and extraction of topic elements for Chinese interactive text. A topic segmentation method based on time sequence is achieved. Then a novel identification and extraction algorithm of topic elements is proposed. Firstly, noise filtering and Chinese word segmentation on the original corpus are executed. Secondly, the identifying and extracting method in group of mixed turn is used to extract the topic elements, such as time, place and figure. Finally, performance evaluation of identification recall, identification accuracy and extraction accuracy are achieved. The experimental results show the effectiveness of the algorithm.
Keywords :
feature extraction; natural language processing; text analysis; Chinese interactive text; Chinese word segmentation; extraction accuracy; figure extraction; identification accuracy; identification recall; noise filtering; place extraction; time extraction; time sequence; topic element extraction; topic element identification; topic segmentation method; Accuracy; Nickel; Satellites; Security; Semantics; XML; Interactive text; Topic elements; Topic elements identification and extraction; Topic segmentation;
Conference_Titel :
Multimedia Technology (ICMT), 2011 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-61284-771-9
DOI :
10.1109/ICMT.2011.6002010