,Exploring features for automatic identification of news queries through query logs

来源 :中国文献情报(英文刊) | 被引量 : 0次 | 上传用户:dreamlisheng
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Purpose:Existing researches of predicting queries with news intents have tried to extract the classification features from extemal knowledge bases,this paper tries to present how to apply features extracted from query logs for automatic identification of news queries without using any exteal resources.Design/methodology/approach:First,we manually labeled 1,220 news queries from Sogou.com.Based on the analysis of these queries,we then identified three features of news queries in terms of query content,time of query occurrence and user click behavior.Afterwards,we used 12 effective features proposed in literature as baseline and conducted experiments based on the support vector machine (SVM) classifier.Finally,we compared the impacts of the features used in this paper on the identification of news queries.Findings:Compared with baseline features,the F-score has been improved from 0.6414 to 0.8368 after the use of three newly-identified features,among which the burst point (bst)was the most effective while predicting news queries.In addition,query expression (qes) was more useful than query terms,and among the click behavior-based features,news URL was the most effective one.Research limitations:Analyses based on features extracted from query logs might lead to produce limited results.Instead of short queries,the segmentation tool used in this study has been more widely applied for long texts.Practical implications:The research will be helpful for general-purpose search engines to address search intents for news events.Originality/value:Our approach provides a new and different perspective in recognizing queries with news intent without such large news corpora as blogs or Twitter.
其他文献
Purpose:The purpose of this research is to investigate Chinese rural women’s information needs and information seeking behavior,with an emphasis on exploration
Purpose:To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach:Data set cons
浙江省茶机公司和衢州市轻工厂于今年3月5~8日在杭州举行名茶培训订货会,参加会议的有来自云南、贵州、广东、福建、湖南、湖北、浙江等11省市茶区的代表共100余人,为了满足
本文通过对中山市12个镇区的水稻土及稻米进行同步取样调查,获取土壤样品及稻米样品各134份,分析了铜、锌、铅、镉、镍、铬、汞、砷等重金属元素,采用内梅罗指数评价法、地质
北京时间1月17日上午7点30分,多国部队第一次空袭巴格达。8点30分,第一辆小汽车驶出了中国青年报社,里面坐着该报国际部的编辑人员。9点,早已在有关单位收看CNN电视的另一名
Purpose:So far,there have been few studies that discussed children’s reading environment in China’s poverty-stricken areas,this study aims to explore differen
以碳酸氢铵、尿素等作水稻田基肥施用,已成为一项节肥、增产的有效施肥新技术,目前全国推广面积已达1亿亩以上。基施氮肥与习惯的分次追肥相比,可提高肥料利用率15—20%,使稻
The paper focuses on the habits of China Web users’language utilization behaviors in accessing the Web.It also seeks to make a general study on the basic natur
Purpose:This study intends to examine the factors influencing user adoption of location based service (LBS).Design/methodology/approach:This paper develops the