Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Informatics, School of >
Informatics Publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/4409

This item has been viewed 17 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
FenschC_Proximity coherence for.pdf856.46 kBAdobe PDFView/Open
Title: Proximity coherence for chip multiprocessors
Authors: Fensch, C.
Barrow-Williams, N.
Moore, S.
Issue Date: 26-Nov-2010
Journal Title: The Nineteenth International Conference on Parallel Architectures and Compilation Techniques (PACT)
Abstract: Many-core architectures provide an efficient way of harnessing the increasing numbers of transistors available in modern fabrication processes. While they are similar to multi-node systems, they exhibit different communication latency and storage characteristics, providing new design opportunities that were previously not feasible. Traditional cache coherence protocols, although often used in many-core designs, have been developed in the context of multinode systems. As such, they seldom take advantage of the new possibilities that many-core architectures offer. We propose Proximity Coherence, a scheme in which L1 load misses are optimistically forwarded to nearby caches via new dedicated links rather than always being indirected via a directory structure. Such an optimization is made possible by the comparable cost of local cache accesses with the use of on-chip network resources. Coherency is maintained using lightweight graph structures embedded in the L1 caches. We compare our Proximity Coherence protocol to an existing directory-based MESI protocol using fullsystem simulations of a 32 core system. Our extension lowers the latency of L1 cache load misses by up to 32% while reducing the bytes transferred on the global on-chip interconnect by up to 19% for a range of parallel benchmarks. Employing Proximity Coherence provides execution time improvements of up to 13%, reduces cache hierarchy energy consumption by up to 30% and delivers a more efficient solution to the challenge of coherence in chip multiprocessors.
Keywords: Proximity Coherence
CMP
cache design
network-on-chip
URI: http://portal.acm.org/citation.cfm?doid=1854273.1854293
http://hdl.handle.net/1842/4409
ISBN: 978-1-4503-0178-7
Appears in Collections:Informatics Publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback