HPCTOOLKIT: tools for performance analysis of optimized parallel programs. HPCTOOLKIT is an integrated suite of tools that supports measurement, analysis, attribution, and presentation of application performance for both sequential and parallel programs. HPCTOOLKIT can pinpoint and quantify scalability bottlenecks in fully optimized parallel programs with a measurement overhead of only a few percent. Recently, new capabilities were added to HPCTOOLKIT for collecting call path profiles for fully optimized codes without any compiler support, pinpointing and quantifying bottlenecks in multithreaded programs, exploring performance information and source code using a new user interface, and displaying hierarchical space–time diagrams based on traces of asynchronous call path samples. This paper provides an overview of HPCTOOLKIT and illustrates its utility for performance analysis of parallel applications. Copyright © 2009 John Wiley & Sons, Ltd

References in zbMATH (referenced in 10 articles )

Showing results 1 to 10 of 10.
Sorted by year (citations)

  1. Nikitenko, D. A.; Wolf, F.; Mohr, B.; Hoefler, T.; Stefanov, K. S.; Voevodin, Vad. V.; Antonov, A. S.; Calotoiu, A.: Influence of noisy environments on behavior of HPC applications (2021)
  2. Benavides, Zachary; Vora, Keval; Gupta, Rajiv; Zhang, Xiangyu: Annotation guided collection of context-sensitive parallel execution profiles (2019)
  3. Dong, Zheng; Liu, Cong: Analysis techniques for supporting hard real-time sporadic gang task systems (2019)
  4. Arleo, Alessio; Didimo, Walter; Liotta, Giuseppe; Montecchiani, Fabrizio: GiVip: a visual profiler for distributed graph processing systems (2018)
  5. Berzins, Martin; Beckvermit, Jacqueline; Harman, Todd; Bezdjian, Andrew; Humphrey, Alan; Meng, Qingyu; Schmidt, John; Wight, Charles: Extending the Uintah framework through the petascale modeling of detonation in arrays of high explosive devices (2016)
  6. Ding, Chen; Xiang, Xiaoya; Bao, Bin; Luo, Hao; Luo, Ying-Wei; Wang, Xiao-Lin: Performance metrics and models for shared cache (2014) ioport
  7. -: Pinned OS/Services: a case study of XML parsing on Intel SCC (2013) ioport
  8. Liu, Xu; Zhan, Jianfeng; Zhan, Kunlin; Shi, Weisong; Yuan, Lin; Meng, Dan; Wang, Lei: Automatic performance debugging of SPMD-style parallel programs (2011) ioport
  9. Adhianto, L.; Banerjee, S.; Fagan, M.; Krentel, M.; Marin, G.; Mellor-Crummey, J.; Tallent, N. R.: HPCTOOLKIT: tools for performance analysis of optimized parallel programs (2010) ioport
  10. Hölldobler, Steffen; Manthey, Norbert; Saptawijaya, Ari: Improving resource-unaware SAT solvers (2010)

Further publications can be found at: http://hpctoolkit.org/publications.html