TY - CHAP
T1 - Conclusions
AU - H. M. Cruz, Eduardo
AU - Diener, Matthias
AU - O. A. Navaux, Philippe
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer International Publishing AG, part of Springer Nature 2018.
PY - 2018
Y1 - 2018
N2 - Locality of memory accesses is one of the most important aspects to be considered when designing an architecture or developing software. With the introduction of multicore architectures, the memory hierarchy had to evolve to able to provide the necessary bandwidth to several cores operating in parallel. With this evolution, memory hierarchies started to present several caches in the same level, some levels shared by multiple cores, and other private to a core. Another important step was the incorporation of a memory controller inside the processor, in which multiprocessor systems presented NUMA characteristics. Due to the introduction of such technologies, the performance of memory hierarchies and the systems as a whole were even more dependent on memory locality. In this context, techniques such as sharing-aware thread and data mapping are able to increase memory locality and thereby performance. Our experiments indicate performance improvements of up to 200% in a scientific application.
AB - Locality of memory accesses is one of the most important aspects to be considered when designing an architecture or developing software. With the introduction of multicore architectures, the memory hierarchy had to evolve to able to provide the necessary bandwidth to several cores operating in parallel. With this evolution, memory hierarchies started to present several caches in the same level, some levels shared by multiple cores, and other private to a core. Another important step was the incorporation of a memory controller inside the processor, in which multiprocessor systems presented NUMA characteristics. Due to the introduction of such technologies, the performance of memory hierarchies and the systems as a whole were even more dependent on memory locality. In this context, techniques such as sharing-aware thread and data mapping are able to increase memory locality and thereby performance. Our experiments indicate performance improvements of up to 200% in a scientific application.
UR - http://www.scopus.com/inward/record.url?scp=85049797477&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85049797477&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-91074-1_5
DO - 10.1007/978-3-319-91074-1_5
M3 - Chapter
AN - SCOPUS:85049797477
T3 - SpringerBriefs in Computer Science
SP - 49
BT - SpringerBriefs in Computer Science
PB - Springer Nature
ER -