Attribute Reduction with Test Cost Constraint

来源 :Journal of Electronic Science and Technology | 被引量 : 0次 | 上传用户:hedayang82
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In many machine learning applications,data are not free,and there is a test cost for each data item. For the economical reason,some existing works try to minimize the test cost and at the same time,preserve a particular property of a given decision system. In this paper,we point out that the test cost one can afford is limited in some applications. Hence,one has to sacrifice respective properties to keep the test cost under a budget. To formalize this issue,we define the test cost constraint attribute reduction problem,where the optimization objective is to minimize the conditional information entropy. This problem is an essential generalization of both the test-cost-sensitive attribute reduction problem and the 0-1 knapsack problem,therefore it is more challenging. We propose a heuristic algorithm based on the information gain and test costs to deal with the new problem. The algorithm is tested on four UCI(University of California-Irvine) datasets with various test cost settings. Experimental results indicate the appropriate setting of the only user-specified parameter λ. In many machine learning applications, data are not free, and there is a test cost for each data item. For the economical reason, some existing works try to minimize the test cost and at the same time, preserve a particular property of a given decision in this paper, we point out that the test cost one can afford is limited in some applications. therefore, one point to that the test cost under a budget. attribute reduction problem, where the optimization objective is to minimize the conditional information entropy. This problem is an essential generalization of both the test-cost-sensitive attribute reduction problem and the 0-1 knapsack problem, therefore it is more challenging. heuristic algorithm based on the information gain and test costs to deal with the new problem. The algorithm is tested on four UCI (University of California-Irvine) datasets with various test cost settings. Expe rimental results indicate the appropriate setting of the only user-specified parameter λ.
其他文献
学位
该文通过对多相流动条件下探边理论研究,揭示了边水驱动油气藏边界导数曲线的变化规律,找出了多相流动条件下探边测试的分析方法,可以准确判断油气藏边界性质.并指导了千米桥
本文针对泡沫压裂液的筛选、性能研究方面存在的不足,用正交实验法分别研究了常温下,起泡剂、稳泡剂、增粘剂对泡沫体系性能的影响,给出了泡沫体系性能衡量参数的影响因素,并
即使在网络时代,板报作为企业一种最通俗、最普及的宣传媒介,在宣传党的方针政策、传播生产科学知识、繁荣企业文化生活等方面,仍然具有其他媒介不可替代的重要作用。    随着当今科技进步、改革创新的蓬勃发展,企业的宣传工具日趋多样化和现代化,以网络和有线电视为代表的现代信息传媒工具,以其优越的性能、显著的效果受到企业的重视,并得到广泛应用。企业板报,这一大众最直观、最受喜爱的艺术宣传形式,正在逐渐从人们
该文从聚合物驱油的机理研究出发,运用数值方法,从一维到三维问题对聚合物驱油的渗流机理及其流动中的各种因素进行了综合研究,并研究了可以作为生产效果预测和评价的其他方
该文根据据油气勘探开发投资基础上风险事件的特点,对所有的风险事件给出了一种分类方式.之后,讨论了项目风险管理的目标、工作内容、程序及人员配备,为逐一研究风险管理的四
学位
学位
该文通过对水基凝胶的成胶时间和力学性能的实验,研究了适于高温井堵水调剖的不同凝胶体系.在高温下(90℃-120℃)HPAM-苯酚-HMTA体系的成胶时间由加入缓凝剂进行调节,低温下(
该文在对宝浪油田宝北区块现场情况系统分析的基础上,对结垢趋势从理论上进行了分析,并采用自行设计加工的试验装置,在尽量模拟现场的条件下,对结垢情况进行了实验研究.通过