DocumentCode :
3700259
Title :
Finding domain-specific termsfrom search engine´s query logs
Author :
Weijian Ni;Tong Liu;Qingtian Zeng
Author_Institution :
College of Information Science and Engineering, Shandong University of Science and Technology, Qingdao 266510, China
Volume :
1
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
301
Lastpage :
308
Abstract :
Automatic domain-specific terms recognition is a basic step for various natural language processing applications. As for most traditional approaches, a domain-specific corpus of high quality and coverage needs to be available in advance. This paper aims to find domain-specific termsfrom a type of general corpus, i.e., search engine´s query logs, which is of higher availability, coverage and timeliness than domain-specific corpora. In the proposed approach, the problem of automatic domain-specific query recognition is formulated as a supervised learning task, where feature representations of every query candidate are derived according to inherent structure of query logs. In addition, an under-sampling technique is employed to solve class-imbalance problem in the supervised learning task. By experimental evaluationon real query logs from a commercial search engine, the result demonstrates the advantages of the proposed approach.
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2015 International Conference on
Type :
conf
DOI :
10.1109/ICMLC.2015.7340939
Filename :
7340939
Link To Document :
بازگشت