A Review of Supercomputer Performance Monitoring Systems
Citations Over Time
Abstract
High Performance Computing is now one of the emerging fields in computer science and its applications. Top HPC facilities, supercomputers, offer great opportunities in modeling diverse processes thus allowing to create more and greater products without full-scale experiments. Current supercomputers and applications for them are very complex and thus are hard to use efficiently. Performance monitoring systems are the tools that help to understand the efficiency of supercomputing applications and overall supercomputer functioning. These systems collect data on what happens on a supercomputer (performance data, performance metrics) and present them in a way allowing to make conclusions about performance issues in programs running on the supercomputer. In this paper we give an overview of existing performance monitoring systems designed for or used on supercomputers. We give a comparison of performance monitoring systems found in literature, describe problems emerging in monitoring large scale HPC systems, and outline our vision on future direction of HPC monitoring systems development.
Related Papers
- → A Review of Supercomputer Performance Monitoring Systems(2021)6 cited
- → The practice of conducting performance analysis of supercomputer applications(2019)1 cited
- → The Next-generation Supercomputer and Visuakization(2006)
- Multi-level Structure Abstract and Description of Supercomputer(2008)
- Susquehanna Chorale Spring Concert "Roots and Wings"(2017)