Article thumbnail

Scalable Tools for Non-Intrusive Performance Debugging of Parallel Linux Workloads

By Robert Schöne, Joseph Schuchart, Thomas Ilsche and Daniel Hackenberg


There is a variety of tools to measure the performance of Linux systems and the applications running on them. However, the resulting performance data is often presented in plain text format or only with a very basic user interface. For large systems with many cores and concurrent threads, it is increasingly difficult to present the data in a clear way for analysis. Moreover, certain performance analysis and debugging tasks require the use of a high-resolution time-line based approach, again entailing data visualization challenges. Tools in the area of High Performance Computing (HPC) have long been able to scale to hundreds or thousands of parallel threads and help finding performance anomalies. We therefore present a solution to gather performance data using Linux performance monitoring interfaces. A combination of sampling and careful instrumentation allows us to obtain detailed performance traces with manageable overhead. We then convert the resulting output to the Open Trace Format (OTF) to bridge the gap between the recording infrastructure and HPC analysis tools. We explore ways to visualize the data by using the graphical tool Vampir. The combination of established Linux and HPC tools allows us to create an interface for easy navigation through time-ordered performance data grouped by thread or CPU and to help users find opportunities for performance optimizations

Topics: info:eu-repo/classification/ddc/004, ddc:004, LINUX ; Leistungsbewertung ; Instrumentation ; Abtastung ; Ablaufverfolgung, Performane Analyse, Linux, perf, Instrumentierung, Sampling, Tracing, performance analysis, linux, perf, instrumentation, sampling, tracing
Publisher: Ottawa Linux Symposium Comittee
Year: 2014
OAI identifier: oai:qucosa:de:qucosa:28410
Provided by: Qucosa
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • (external link)
  • (external link)
  • (external link)
  • (external link)
  • Suggested articles

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.