DocumentCode :
3449661
Title :
A Method for Collecting Uighur Websites
Author :
Wang Zhijuan
Author_Institution :
Nat. Language Resource Monitoring & Res. Center Beijing, Minzu Univ. of China, Beijing, China
fYear :
2013
fDate :
1-3 Nov. 2013
Firstpage :
260
Lastpage :
263
Abstract :
The URLs of Uighur Web site are complex and it leads to that it is difficult to collect Uighur Web site. Firstly, features of Uighur Web site are analyzed. Then, the method to collect Uighur Web site is introduced in three steps: collect the Web pages may be in Uighur first, judge whether the Web page is in Uighur or not, at last, find the URL of Uighur Web site using the URL of Uighur Web page. Using the method, about 1,000 Uighur Web site are collected.
Keywords :
Web sites; search engines; URL; Uighur Web page collection; Uighur Web site collection; Uighur Web site feature analysis; search engines; Educational institutions; HTML; Internet; Search engines; Standards; Web pages; Uighur websites; web page collecting; web page language;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Networks and Intelligent Systems (ICINIS), 2013 6th International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-1-4799-2808-8
Type :
conf
DOI :
10.1109/ICINIS.2013.73
Filename :
6754722
Link To Document :
بازگشت