Home > Contribution List

Contribution List

Displaying 12 contributions out of 12
Session: Session 4
on 23 Jun 2016 at 17:45
Session: Session 1
The Hartree Centre has been established as a UK focus for industrial engagement. STFC has acquired a new IBM system based on the OpenPOWER architecture, comprising 32 nodes with POWER8 CPUs and nVIDIA Kepler K80 GPUs. We report early evaluation of the system using some real applications based on the Lattice Boltzmann Method and using FFTs. No optimisation has been carried out yet, but results are ... More
on 23 Jun 2016 at 10:30
Session: Session 2
All-atom molecular dynamics simulations of biomolecules provide a powerful tool for exploring the structure and dynamics of large protein complexes within realistic cellular environments. Unfortunately, such simulations are extremely demanding in terms of their computational requirements, and they present many challenges in terms of preparation, simulation methodology, and analysis and visualizati ... More
on 23 Jun 2016 at 11:30
Session: Session 3
Modern servers provide different features for managing the amount of energy that is needed to execute a given work-load. In this article we focus on a new generation of GPU-accelerated servers with POWER8 processors. For different scientific applications, which have in common that they have been written for massively-parallel computers, we measure energy-to-solution for different system configurat ... More
on 23 Jun 2016 at 14:45
Session: Session 3
In this article we present algorithmic rethinking and code re-engineering that is required in porting highly successful and popular planewave codes to next generation heterogeneous OpenPOWER architectures, that foster acceleration and high bandwidth links to GPUs. In this work we focus on CPMD as the most representative software for ad initio molecular dynamics simulations. We have ported to the G ... More
on 23 Jun 2016 at 15:30
Session: Session 1
This paper discusses the performance of IBM’s Power8 CPUs, on a number of skeleton, financial and CFD benchmarks and applications. Implicitly, the performance of the software toolchain is also tested - the bare-bones Little-Endian Ubuntu, the GNU 5.3 and the XL 14.1.3 compilers and OpenMP runtimes. First, we aim to establish some roofline numbers on bandwidth and compute throughput, then move on ... More
on 23 Jun 2016 at 09:45
Session: Session 3
This paper presents the design and implementation of energy measurement and management features found in OpenPOWER systems. The firmware and its ecosystem are open source to allow the community to extend the capabilities.
on 23 Jun 2016 at 14:00
Session: Session 4
The panel will discuss opportunities for HPC based on OpenPOWER technologies. On the panel will be: * Jeff Vetter (ORNL Future Technologies Group) * Rich Graham (Mellanox) * Piero Altoe (E4) * John Stone (University of Illinois at Urbana-Champaign)
on 23 Jun 2016 at 17:00
Session: Session 2
POWER 8, the latest RISC (Reduced Instruction Set Computer) microprocessor of the IBM Power architecture family, was designed to significantly benefit emerging workloads, including Business Analytics, Cloud Computing and High Performance Computing. In this paper, we provide a thorough performance evaluation on a widely used large-scale graph processing framework, Spark/GraphX, on a POWER 8 cluster ... More
on 23 Jun 2016 at 12:15
Session: Session 4
The IBM POWER8 CPU is a high-performance multi-core hardware which targets the usage with computational intense numerical codes. Combustion modeling is among the most computational demanding mathematical problems. Therefore, in this paper we present a performance analysis of the 3D-combustion modeling software RECOM-AIOLOS on a POWER8 node. The analysis reveals the strengths of the POWER8 hardware ... More
on 23 Jun 2016 at 16:30
Session: Session 1
With the appearance of the heterogeneous platform OpenPOWER, many-core accelerator devices have been coupled with POWER host processors for the first time. Towards utilizing their full potential, it is worth investigating performance portable algorithms that allow to choose the best-fitting hardware for each domain-specific compute task. Suiting even the high level of parallelism on modern GPGPUs, ... More
on 23 Jun 2016 at 09:15
Session: Session 1
on 23 Jun 2016 at 09:00