Efficient Aggregation Algorithms on Very Large Compressed Data Warehouses

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:damitanqq
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Multidimensional aggregation is a dominant operation on data warehouses for on-line analytical processing (CLAP). Many efficient algorithms to compute multidimensional aggregation on relational database based data warehouses have been developed. However, to our knowledge, there is nothing to date in the literature about aggregation algorithms on multidimensional data warehouses that store datasets in multidimensional arrays rather than in tables. This paper presents a set of multidimensional aggregation algorithms on very large and compressed multidimensional data warehouses. These algorithms operate directly on compressed datasets in multidimensional data warehouses without the need to first decompress them. They are applicable to a variety of data compression methods. The algorithms have different performance behavior as a function of dataset parameters, sizes of outputs and main memory availability. The algorithms are described and analyzed with respect to the I/O and CPU costs. A decision procedure to select the most efficient algorithm, given an aggregation request, is also proposed. The analytical and experimental results show that the algorithms are more efficient than the traditional aggregation algorithms.
其他文献
目的对快速康复外科理念对围术期剖宫产孕妇深静脉血栓发生的影响做探讨。方法此次研究中将2014年8月—2017年5月作为对象抽取时间段,于期间随机抽取择期行剖宫产术的孕妇200