References in zbMATH (referenced in 17 articles )

Showing results 1 to 17 of 17.
Sorted by year (citations)

  1. Ghysels, Pieter; Vanroose, Wim: Modeling the performance of geometric multigrid stencils on multicore computer architectures (2015)
  2. Malas, T.; Hager, G.; Ltaief, H.; Stengel, H.; Wellein, G.; Keyes, D.: Multicore-optimized wavefront diamond blocking for optimizing stencil updates (2015)
  3. Röhrig-Zöllner, Melven; Thies, Jonas; Kreutzer, Moritz; Alvermann, Andreas; Pieper, Andreas; Basermann, Achim; Hager, Georg; Wellein, Gerhard; Fehske, Holger: Increasing the performance of the Jacobi-Davidson method by blocking (2015)
  4. Lee, Jaehwan; Keleher, Pete; Sussman, Alan: Exploiting multi-core nodes in peer-to-peer grids (2014)
  5. Abed, Khalid H.; Morris, Gerald R.: Improving performance of codes with large/irregular stride memory access patterns via high performance reconfigurable computers (2013)
  6. Collier, Nathan; Dalcin, Lisandro; Pardo, David; Calo, V.M.: The cost of continuity: performance of iterative solvers on isogeometric finite elements (2013)
  7. Wu, Xingfu; Taylor, Valerie: Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers (2013)
  8. Habich, J.; Zeiser, T.; Hager, G.; Wellein, G.: Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA (2011)
  9. Heinzl, René; Schwaha, Philipp: A generic topology library (2011)
  10. Götz, J.; Iglberger, K.; Feichtinger, C.; Donath, S.; Rüde, U.: Coupling multibody dynamics and computational fluid dynamics on 8192 processor cores (2010)
  11. Strohmaier, Erich: Generalized utility metrics for supercomputers (2009)
  12. Saini, Subhash; Ciotti, Robert; Gunney, Brian T.N.; Spelce, Thomas E.; Koniges, Alice; Dossa, Don; Adamidis, Panagiotis; Rabenseifner, Rolf; Tiyyagura, Sunil R.; Mueller, Matthias: Performance evaluation of supercomputers using HPCC and IMB benchmarks (2008)
  13. Zeiser, T.; Wellein, G.; Nitsure, A.; Iglberger, K.; Rüde, U.; Hager, G.: Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method (2008)
  14. an Mey, Dieter; Sarholz, Samuel; Terboven, Christian: Nested parallelization with openMP (2007)
  15. Baker, A.H.; Dennis, J.M.; Jessup, E.R.: On improving linear solver performance: a block variant of GMRES (2006)
  16. Nakajima, Kengo: Three-level hybrid vs. flat MPI on the Earth simulator: parallel iterative solvers for finite-element method (2005)
  17. Brown, P. N.; Chang, B.; Hanebutte, U. R.; Woodward, C. S.: The quest for a high-performance Boltzmann transport solver. (2000)