Optimizing memory locality using a locality-aware page table

Eduardo H.M. Cruz, Matthias Diener, Marco A.Z. Alves, Laércio L. Pilla, Philippe O.A. Navaux

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

One of the main challenges for modern parallel shared-memory architectures are accesses to main memory. In current systems, the performance and energy efficiency of memory accesses depend on their locality: accesses to remote caches and NUMA nodes are more expensive than accesses to local ones. Increasing the locality requires knowledge about how the threads of a parallel application access memory pages. With this information, pages can be migrated to the NUMA nodes that access them (data mapping), as well as threads that access the same pages can be migrated to the same node such that locality can be improved even further (thread mapping). In this paper, we propose LAPT, a mechanism to store the memory access pattern of parallel applications in the page table, which is updated by the hardware during TLB misses. This information is used by the operating system to perform an optimized thread and data mapping during the execution of the parallel application. In contrast to previous work, LAPT does not require any previous information about the behavior of the applications, or changes to the application or runtime libraries. Extensive experiments with the NAS Parallel Benchmarks (NPB) and PARSEC showed performance and energy efficiency improvements of up to 19.2% and 15.7%, respectively, (6.7% and 5.3% on average).

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 26th International Symposium
PublisherIEEE Computer Society
Pages198-205
Number of pages8
ISBN (Electronic)9781479969043
DOIs
StatePublished - Dec 1 2014
Externally publishedYes
Event26th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2014 - Paris, France
Duration: Oct 22 2014Oct 24 2014

Publication series

NameProceedings - Symposium on Computer Architecture and High Performance Computing
ISSN (Print)1550-6533

Other

Other26th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2014
Country/TerritoryFrance
CityParis
Period10/22/1410/24/14

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'Optimizing memory locality using a locality-aware page table'. Together they form a unique fingerprint.

Cite this