Locality-aware Cache Hierarchy Management for Multicore Processors

Locality-aware Cache Hierarchy Management for Multicore Processors
Author :
Publisher :
Total Pages : 194
Release :
ISBN-10 : OCLC:912300074
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Locality-aware Cache Hierarchy Management for Multicore Processors by :

Download or read book Locality-aware Cache Hierarchy Management for Multicore Processors written by and published by . This book was released on 2015 with total page 194 pages. Available in PDF, EPUB and Kindle. Book excerpt: Next generation multicore processors and applications will operate on massive data with significant sharing. A major challenge in their implementation is the storage requirement for tracking the sharers of data. The bit overhead for such storage scales quadratically with the number of cores in conventional directory-based cache coherence protocols. Another major challenge is limited cache capacity and the data movement incurred by conventional cache hierarchy organizations when dealing with massive data scales. These two factors impact memory access latency and energy consumption adversely. This thesis proposes scalable efficient mechanisms that improve effective cache capacity (i.e., by improving utilization) and reduce data movement by exploiting locality and controlling replication. First, a limited directory-based protocol, ACKwise is proposed to track the sharers of data in a cost-effective manner. ACKwise leverages broadcasts to implement scalable cache coherence. Broadcast support can be implemented in a 2-D mesh network by making simple changes to its routing policy without requiring any additional virtual channels. Second, a locality-aware replication scheme that better manages the private caches is proposed. This scheme controls replication based on data reuse information and seamlessly adapts between private and logically shared caching of on-chip data at the fine granularity of cache lines. A low-overhead runtime profiling capability to measure the locality of each cache line is built into hardware. Private caching is only allowed for data blocks with high spatio-temporal locality. Third, a Timestamp-based memory ordering validation scheme is proposed that enables the locality-aware private cache replication scheme to be implementable in processors with out-of-order memory that employ popular memory consistency models. This method does not rely on cache coherence messages to detect speculation violations, and hence is applicable to the locality-aware protocol. The timestamp mechanism is efficient due to the observation that consistency violations only occur due to conflicting accesses that have temporal proximity (i.e., within a few cycles of each other), thus requiring timestamps to be stored only for a small time window. Fourth, a locality-aware last-level cache (LLC) replication scheme that better manages the LLC is proposed. This scheme adapts replication at runtime based on fine-grained cache line reuse information and thereby, balances data locality and off-chip miss rate for optimized execution. Finally, all the above schemes are combined to obtain a cache hierarchy replication scheme that provides optimal data locality and miss rates at all levels of the cache hierarchy. The design of this scheme is motivated by the experimental observation that both locality-aware private cache & LLC replication enable varying performance improvements across benchmarks. These techniques enable optimal use of the on-chip cache capacity, and provide low-latency, low-energy memory access, while retaining the convenience of shared memory and preserving the same memory consistency model. On a 64-core multicore processor with out-of-order cores, Locality-aware Cache Hierarchy Replication improves completion time by 15% and energy by 22% over a state-of-the-art baseline while incurring a storage overhead of 30.7 KB per core. (i.e., 10% the aggregate cache capacity of each core).

Locality-aware Cache Hierarchy Management for Multicore Processors Related Books

Locality-aware Cache Hierarchy Management for Multicore Processors
Language: en
Pages: 194
Authors:
Categories:
Type: BOOK - Published: 2015 - Publisher:

GET EBOOK

Next generation multicore processors and applications will operate on massive data with significant sharing. A major challenge in their implementation is the st
Locality-aware Task Management on Many-core Processors
Language: en
Pages:
Authors: Richard Myungon Yoo
Categories:
Type: BOOK - Published: 2012 - Publisher:

GET EBOOK

The landscape of computing is changing. Due to limits in transistor scaling, the traditional approach to exploit instruction-level parallelism through wide-issu
Multi-Core Cache Hierarchies
Language: en
Pages: 155
Authors: Rajeev Balasubramonian
Categories: Technology & Engineering
Type: BOOK - Published: 2011-06-06 - Publisher: Morgan & Claypool Publishers

GET EBOOK

A key determinant of overall system performance and power dissipation is the cache hierarchy since access to off-chip memory consumes many more cycles and energ
Handbook of Research on High Performance and Cloud Computing in Scientific Research and Education
Language: en
Pages: 476
Authors: Despotovi?-Zraki?, Marijana
Categories: Computers
Type: BOOK - Published: 2014-03-31 - Publisher: IGI Global

GET EBOOK

As information systems used for research and educational purposes have become more complex, there has been an increase in the need for new computing architectur
A Higher Order Theory of Locality and Its Application in Multicore Cache Management
Language: en
Pages: 186
Authors: Xiaoya Xiang
Categories:
Type: BOOK - Published: 2014 - Publisher:

GET EBOOK

"As multi-core processors become commonplace and cloud computing is gaining acceptance, applications are increasingly run in parallel over a shared memory hiera