An evaluation of current SIMD programming models for C++
2016pp. 1–8
Citations Over TimeTop 10% of 2016 papers
Abstract
SIMD extensions were added to microprocessors in the mid '90s to speed-up data-parallel code by vectorization. Unfortunately, the SIMD programming model has barely evolved and the most efficient utilization is still obtained with elaborate intrinsics coding. As a consequence, several approaches to write efficient and portable SIMD code have been proposed. In this work, we evaluate current programming models for the C++ language, which claim to simplify SIMD programming while maintaining high performance.
Related Papers
- → Auto-vectorization of interleaved data for SIMD(2006)123 cited
- → Multi-platform Auto-vectorization(2006)97 cited
- → Insufficient Vectorization: A New Method to Exploit Superword Level Parallelism(2016)1 cited
- → Performance and efficiency investigations of SIMD programs of Coulomb solvers on multi-and many-core systems with vector units(2020)1 cited
- → Micro-Kernels for Portable and Efficient Matrix Multiplication in Deep Learning(2022)3 cited