Interprocedural Compiler Analysis for Reducing Memory Latency
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Title
Interprocedural Compiler Analysis for Reducing Memory Latency
Alternative title
Authors
Published Date
1997
Publisher
Type
Report
Abstract
The growing discrepancy between the processor and the memory speed forces us to find a
way to improve the performance of memory systems. In a current typical memory system, a
cache miss incurs heavy penalty in terms of latency. With shared-memory multiprocessors, the
problem of high memory latency is more pronounced than uniprocessor systems because most
or all of the main memory system is remote to the processor, whose accesses are significantly
slower than accesses to the cache and the local memory. This thesis makes three contributions
towards the reduction of remote memory accesses. First, we present an interprocedural compiler
analysis to statically align the scheduling of the parallel tasks and to allocate data so as to
reduce the number of remote memory references. Second, most compiler tedmiques for data
locality enhancement benefit more from static task scheduling methods, but static scheduling
potentially results in unbalanced workload among the processors. This thesis proposes a compiler
algorithm to identify parallel loops with relatively balanced operation counts in their iterations.
Furthermore, we describe an experiment to compare the workload balance under different task
scheduling methods. Third, this thesis examines the interaction between different. levels of memory
mapping on Cache Coherent, Non Uniform Memory Access (CC-NUMA) multiprocessors and
compares several mapping alternatives. Event-driven simulations a.re performed to examine the
effectiveness of our compiler techniques and to compare different mapping alternatives. The
simulation results show that data allocation schemes often make a great impact on the execution
time even with sizable private caches and that our compiler schemes perform consistently well.
Keywords
Description
Related to
Replaces
License
Series/Report Number
Technical Report; 97-001
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Nguyen, Trung Ngoc. (1997). Interprocedural Compiler Analysis for Reducing Memory Latency. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/215304.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.