SC16 Tutorial: Application porting and optimization on GPU-accelerated POWER architectures
The POWER processor has re-emerged as a technology for supercomputer architectures. One major reason is the tight integration of processor and GPU accelerator through the new NVLink technology. Two major sites in the US, ORNL and LLNL, have already decided to have their pre-exascale systems being based on this new architecture. This tutorial will give an opportunity to obtain in-depth knowledge and first experience with GPU-accelerated POWER nodes. It focuses on porting applications to a single node and covers the topics architecture, compilers, performance analysis and multi-GPU programming. The tutorial will include an overview of the new NVLink based node architectures, lectures on first-hand experience in porting to this architecture, and will conclude with exercises using tools to focus on performance.
- Christoph Hagleitner (IBM Research)
- Oscar Hernandez (ORNL)
- Andreas Herten (Jülich Supercomputing Centre)
- Jiri Kraus (NVIDIA)
- Bronson Messer (ORNL)
- Archana Ravindar (IBM)