DEAM:Decoupled, Expressive, Area-Efficient Metadata Cache

来源 :Journal of Computer Science & Technology | 被引量 : 0次 | 上传用户:sherpa
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Chip multiprocessor presents brand new opportunities for holistic on-chip data and coherence management solutions. An intelligent protocol should be adaptive to the fine-grain accessing behavior. And in terms of storage of metadata,the size of conventional directory grows as the square of the number of processors, making it very expensive in large-scale systems. In this paper, we propose a metadata cache framework to achieve three goals: 1) reducing the latency of data access and coherence activities, 2) saving the storage of metadata, and 3) providing support for other optimization techniques.The metadata is implemented with compact structures and tracks the dynamically changing access pattern. The pattern information is used to guide the delegation and replication of decoupled data and metadata to allow fast access. We also use our metadata cache as a building block to enhance stream prefetching. Using detailed execution-driven simulation, we demonstrate that our protocol achieves an average speedup of 1.12 X compared with a shared cache protocol with 1/5 of the storage of metadata. Chip multiprocessor presents brand new opportunities for holistic on-chip data and coherence management solutions. An intelligent protocol should be adaptive to the fine-grain access behavior. And in terms of storage of metadata, the size of conventional directory grows as the square of the number of processors, making it very expensive in large-scale systems. 1, reducing the latency of data access and coherence activities, 2) saving the storage of metadata, and 3) providing support for other optimization techniques. The metadata is implemented with compact structures and tracks the dynamically changing access pattern. The pattern information is used to guide the delegation and replication of decoupled data and metadata to allow fast access. We also use our metadata cache as a building block to enhance stream prefetching. Using detailed execution-driven simulation, we demonstrate that our protocol achieves an average speedup of 1.12 X compared with a shared cache protocol with 1/5 of the storage of metadata.
其他文献
In this paper, we study the existence, uniqueness, continuous dependence, Ulam stabilities and exponential stability of random impulsive semilinear differential
A type of distributed pump-jet propulsion system (DPJP) is developed with two or four specially designed pump-jet pods located around the axisymmetric underwate
Recent advances in connected vehicles and autonomous driving are going to change the face of ground trans-portation as we know it. This paper describes the desi
In this paper we extend the results obtained in [3], where are investigated the general settings of the two-dimensional complex Finsler manifolds, with respect
Performance metrics and models are prerequisites for scientific understanding and optimization. This paper introduces a new footprint-based theory and reviews t
Multiple description coding (MDC) generates multiple decodable bitstreams for a source to combat informa-tion loss. In this paper, multipath routing problem for
A new method is presented for the water impact simulation, in which the air-water two phase flow is solved using the pressure-based computational fluid dynamics
Silicon physical unclonable function(PUF) is a popular hardware security primitive that exploits the intrinsic variation of IC manufacturing process to generate