DocumentCode
1908376
Title
Categorization and Identification of Fragments with Shi Plus Punctuation
Author
Guonian Wang ; Lin He
Author_Institution
Sch. of Foreign Languages, China Univ. of Geosci., Wuhan, China
fYear
2013
fDate
17-19 Aug. 2013
Firstpage
47
Lastpage
50
Abstract
Studies on Chinese sentences with shi (ÊÇ) as predicate have been profoundly fruitful from the perspective of syntax, semantics and pragmatics. In a broader sense, however, a large number of sentences with shi functioning as other syntactic roles - adverb, conjunction, auxiliary and even interjection - are practically used, and stand as barriers to natural language processing (NLP) and machine translation (MT). The special fragments consisting of shi plus punctuation are divided into "shi plus comma" and "comma plus shi", which are examined and discussed with the instruments of corpora, illustrations and comparison. Two exceptional fragments are also briefed to improve the precision in computer identification of these shi-plus-punctuation fragments.
Keywords
language translation; natural language processing; Chinese sentences; MT; NLP; adverb; auxiliary; comma plus shi; computer identification; conjunction; fragment categorization; fragment identification; interjection; machine translation; natural language processing; pragmatics; semantics; shi plus comma; shi plus punctuation fragment; syntax; Computers; Dictionaries; Educational institutions; Europe; Natural language processing; Semantics; Syntactics; Shi-fragments; comma plus shi; computer identification; shi plus comma; shi-sentences;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location
Urumqi
Type
conf
DOI
10.1109/IALP.2013.16
Filename
6646000
Link To Document