Computation on Sentence Semantic Distance for Novelty Detection

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:zhuyi9021
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Novelty detection is to retrieve new information and filter redundancy from given sentences that are relevant to a specific topic. In TREC2003, the authors tried an approach to novelty detection with semantic distance computation.The motivation is to expand a sentence by introducing semantic information. Computation on semantic distance between sentences incorporates WordNet with statistical information. The novelty detection is treated as a binary classification problem: new sentence or not. The feature vector, used in the vector space model for classification, consists of various factors, including the semantic distance from the sentence to the topic and the distance from the sentence to the previous relevant context occurring before it. New sentences are then detected with Winnow and support vector machine classifiers,respectively. Several experiments are conducted to survey the relationship between different factors and performance. It is proved that semantic computation is promising in novelty detection. The ratio of new sentence size to relevant size is further studied given different relevant document sizes. It is found that the ratio reduced with a certain speed (about 0.86).Then another group of experiments is performed supervised with the ratio. It is demonstrated that the ratio is helpful to improve the novelty detection performance.
其他文献
Conflicts between two or more parties arise for various reasons and perspectives. Thus, resolution of conflicts frequently relies on some form of negotiation. T
最近,一种名为“洗血”的医美项目被炒得火热:从身体中抽取一定量的血液,向血液中注入臭氧,之后再输回体内,就可以“净化”血液。还有说法称,这种方法可美容养颜,是驻颜逆龄的神奇之术。那么,洗血真的有这些功效吗?  中国医学科学院整形外科医院专家表示,目前并没有科学证据证明通过这样的一个所谓的血液净化的方法,就能美容养颜、延年益寿。在医学上最有效的血液疗法就是透析,在得了尿毒症的情况下,通过透析的方法,
期刊
期刊
本栏目内容选自《人民日报》《健康报》《北京晚报》《北京科技报》、蝌蚪五线谱、“谣言过滤器”微信公众号、“科普北京”微信公众号等。第一时间按下所有楼层键,电梯坠地
期刊
期刊