A practical automatic polyhedral parallelizer and locality optimizer
Citations Over TimeTop 1% of 2008 papers
Abstract
We present the design and implementation of an automatic polyhedral source-to-source transformation framework that can optimize regular programs (sequences of possibly imperfectly nested loops) for parallelism and locality simultaneously. Through this work, we show the practicality of analytical model-driven automatic transformation in the polyhedral model -- far beyond what is possible by current production compilers. Unlike previous works, our approach is an end-to-end fully automatic one driven by an integer linear optimization framework that takes an explicit view of finding good ways of tiling for parallelism and locality using affine transformations. The framework has been implemented into a tool to automatically generate OpenMP parallel code from C program sections. Experimental results from the tool show very high speedups for local and parallel execution on multi-cores over state-of-the-art compiler frameworks from the research community as well as the best native production compilers. The system also enables the easy use of powerful empirical/iterative optimization for general arbitrarily nested loop sequences.
Related Papers
- → A practical automatic polyhedral parallelizer and locality optimizer(2008)876 cited
- → A practical automatic polyhedral parallelizer and locality optimizer(2008)236 cited
- PLuTo: A Practical and Fully Automatic Polyhedral Program Optimization System(2015)
- → Efficient nested loop pipelining in high level synthesis using polyhedral bubble insertion(2011)17 cited
- → Nested-Loops Tiling for Parallelization and Locality Optimization(2017)2 cited