Cudalaunch visual profiler

11/4/2022

CUDALAUNCH VISUAL PROFILER FULL
CUDALAUNCH VISUAL PROFILER CODE

PIConGPU is a highly optimized application that runs production jobs at scale on a system Oak more » Ridge Leadership Facility’s (OLCF) Summit supercomputer (using the full machine at 4600 nodes at 98% of GPU utilization on all ~28000 NVIDIA Volta GPUs).

While PIConGPU has been optimized for at least 5 years to run well on NVIDIA GPU-based clusters, there has been limited exploration by the development team of potential scalability bottlenecks using recently updated and new tools including NVIDIA’s NVProf tool and the brand-new NVIDIA NSight Suite (Systems and Compute) tools. PIConGPU, Particle In Cell on GPUs, is an open source simulations framework for plasma and laser-plasma physics used to develop advanced particle accelerators for radiation therapy of cancer, high energy physics and photon science. This allows the user to see the progress of the behavior of the applications during their lifetime. Furthermore, our tool provides these statistics during more » the entire runtime of the tool as a time series and not just aggregate statistics at the end of the application run. Most of the performance counter results are the same in both vendor tools and our tool when using LDMS to collect these results. Also, we discuss our current validation results. In this report, we discuss the current limitations in the NVIDIA monitoring tools, how we overcame such limitations, and present an overview of the tool we built to monitor GPU performance in LDMS and its capabilities. Therefore, we chose to develop a GPU monitoring capability within the same framework. Sandia has developed CPU application monitoring capability within LDMS. The Light-Weight Distributed Metric System (LDMS) at Sandia is an infrastructure widely adopted for large-scale systems and application monitoring. Since NVIDIA GPUs are currently the most commonly implemented in HPC applications and systems, NVIDIA tools are the solution for performance monitoring.

CUDALAUNCH VISUAL PROFILER CODE

To deliver the best performance on a GPU, we need to create monitoring tools to ensure that we optimize the code to get the most performance and efficiency out of a GPU. They are viewed by many as a technology facilitator for the surge in fields like machine learning and Convolutional Neural Networks.

GPUs are now a fundamental accelerator for many high-performance computing applications.

0 Comments

Cudalaunch visual profiler

CUDALAUNCH VISUAL PROFILER CODE

Leave a Reply.

Author

Archives

Categories