论文部分内容阅读
WebView是指存储在WebRepository中的Web页面。WebView对于很多系统来说都非常有用,它可以给用户的查询和分析带来更快的效率,特别适合联机分析处理(OLAP)和决策支持。然而当使用Repository中的信息为用户服务时,笔者无法保证所提供的信息是最新的(与源数据保持up-to-date)。在这种情况下,虽然把这些信息返回给用户,实际上却不知道这些信息是否可以满足用户的需要。为了提高数据质量,系统需要尽可能提高数据时新性(Freshness),保持Repository与数据源相一致。该文围绕数据时新性,对系统存储哪些页面,这些页面又如何更新和维护才能取得系统能力和效率之间的平衡进行讨论,并提出一种基于效益的时新性保持方法(Profit-basedFreshness-keepingMethod,PFM),同时给出了它的近似解。实验结果说明,该方法在系统效率和数据时新性综合评估方面优于传统方法,同时对Web环境具有良好的适应性。
WebView refers to the Web pages stored in WebRepository. WebView is very useful for many systems, it can give users faster query and analysis efficiency, especially for online analytical processing (OLAP) and decision support. However, I can not guarantee that the information provided is up-to-date (up-to-date with the source data) when using the information in the Repository to serve my users. In this case, although the information returned to the user, in fact, do not know whether these information can meet the needs of users. To improve data quality, the system needs to maximize data freshness and keep Repository consistent with data sources. This paper focuses on the newness of data, and discusses how the pages are stored in the system and how the pages can be updated and maintained in order to achieve a balance between system capability and efficiency. Then, a Profit-based Freshness Method based on Benefit -keepingMethod, PFM) and gives its approximate solution. The experimental results show that this method is superior to the traditional method in the aspects of system efficiency and new comprehensive evaluation of data, and has good adaptability to Web environment.