TAU: The TAU parallel performance system. The ability of performance technology to keep pace with the growing complexity of parallel and distributed systems depends on robust performance frameworks that can at once provide system-specific performance capabilities and support high-level performance problem solving. Flexibility and portability in empirical methods and processes are influenced primarily by the strategies available for instrmentation and measurement, and how effectively they are integrated and composed. This paper presents the TAU (Tuning and Analysis Utilities) parallel performance sytem and describe how it addresses diverse requirements for performance observation and analysis.

References in zbMATH (referenced in 16 articles )

Showing results 1 to 16 of 16.
Sorted by year (citations)

  1. de la Encina, Alberto; Rodríguez, Ismael; Rubio, Fernando: pHood: tool description, analysis techniques, and case studies (2014)
  2. Ding, Chen; Xiang, Xiaoya; Bao, Bin; Luo, Hao; Luo, Ying-Wei; Wang, Xiao-Lin: Performance metrics and models for shared cache (2014)
  3. Chandrasekaran, Sunita; Shanbagh, Shilpa; Jayaraman, Ramkumar; Maskell, Douglas L.; Cheah, Hui Yan: C2FPGA -- a dependency-timing graph design methodology (2013)
  4. De Cecchis, D.; Drummond, L.A.; Castillo, J.E.: Design of a distributed coupling toolkit for high performance computing environment (2013)
  5. Amritkar, Amit; Tafti, Danesh; Liu, Rui; Kufrin, Rick; Chapman, Barbara: OpenMP parallelism for fluid and fluid-particulate systems (2012)
  6. Camata, J.J.; Rossa, A.L.; Valli, Andrea M.P.; Catabriga, Lucia; Carey, Graham F.; Coutinho, Alvaro L.G.A.: Reordering and incomplete preconditioning in serial and parallel adaptive mesh refinement and coarsening flow solutions (2012)
  7. Mohror, Kathryn; Karavanic, Karen L.: Trace profiling: scalable event tracing on high-end parallel systems (2012)
  8. Rutar, Nick; Hollingsworth, Jeffrey K.: Data centric techniques for mapping performance data to program variables (2012)
  9. Vömel, Christof; Tomov, Stanimire; Dongarra, Jack: Divide and conquer on hybrid GPU-accelerated multicore systems (2012)
  10. Crivellini, A.; Bassi, F.: An implicit matrix-free discontinuous Galerkin solver for viscous and turbulent aerodynamic simulations (2011)
  11. Elias, Renato N.; Camata, Jose J.; Aveleda, Albino; Coutinho, Alvaro L.G.A.: Evaluation of message passing communication patterns in finite element solution of coupled problems (2011)
  12. Liu, Xu; Zhan, Jianfeng; Zhan, Kunlin; Shi, Weisong; Yuan, Lin; Meng, Dan; Wang, Lei: Automatic performance debugging of SPMD-style parallel programs (2011)
  13. Geimer, Markus; Kuhlmann, Björn; Pulatova, Farzona; Wolf, Felix; Wylie, Brian J.N.: Scalable collation and presentation of call-path profile data with CUBE (2008)
  14. Morris, Alan; Malony, Allen D.; Shende, Sameer S.: Supporting nested OpenMP parallelism in the TAU performance system (2008)
  15. Terrel, A.R.; Scott, L.R.; Knepley, M.G.; Kirby, R.C.: Automated FEM discretizations for the Stokes equation (2008)
  16. Morris, Alan; Malony, Allen D.; Shende, Sameer S.: Supporting nested OpenMP parallelism in the TAU performance system (2007)