References in zbMATH (referenced in 19 articles )

Showing results 1 to 19 of 19.
Sorted by year (citations)

  1. Moxey, David; Amici, Roman; Kirby, Mike: Efficient matrix-free high-order finite element evaluation for simplicial elements (2020)
  2. Nikitenko, D. A.; Shvets, P. A.; Voevodin, V. V.: Why do users need to take care of their HPC applications efficiency? (2020)
  3. Bukata, Libor; Šucha, Přemysl; Hanzálek, Zdeněk: Optimizing energy consumption of robotic cells by a branch & bound algorithm (2019)
  4. Bauer, S.; Drzisga, D.; Mohr, M.; Rüde, U.; Waluga, C.; Wohlmuth, B.: A stencil scaling approach for accelerating matrix-free finite element implementations (2018)
  5. Elafrou, Athena; Karakasis, Vasileios; Gkountouvas, Theodoros; Kourtis, Kornilios; Goumas, Georgios; Koziris, Nectarios: SparseX: a library for high-performance sparse matrix-vector multiplication on multicore platforms (2018)
  6. Schoeder, S.; Kormann, K.; Wall, W. A.; Kronbichler, M.: Efficient explicit time stepping of high order discontinuous Galerkin schemes for waves (2018)
  7. Towara, M.; Naumann, U.: SIMPLE adjoint message passing (2018)
  8. Wittmann, M.; Haag, V.; Zeiser, T.; Köstler, H.; Wellein, G.: Lattice Boltzmann benchmark kernels as a testbed for performance analysis (2018)
  9. Agrawal, Tarun K.; Sahu, Aryabartta; Ghose, Manojit; Sharma, R.: Scheduling chained multiprocessor tasks onto large multiprocessor system (2017)
  10. Bird, R. E.; Coombs, W. M.; Giani, S.: Fast native-MATLAB stiffness assembly for SIPG linear elasticity (2017)
  11. Pieper, Andreas; Kreutzer, Moritz; Alvermann, Andreas; Galgon, Martin; Fehske, Holger; Hager, Georg; Lang, Bruno; Wellein, Gerhard: High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations (2016)
  12. Malas, T.; Hager, G.; Ltaief, H.; Stengel, H.; Wellein, G.; Keyes, D.: Multicore-optimized wavefront diamond blocking for optimizing stencil updates (2015)
  13. Röhrig-Zöllner, Melven; Thies, Jonas; Kreutzer, Moritz; Alvermann, Andreas; Pieper, Andreas; Basermann, Achim; Hager, Georg; Wellein, Gerhard; Fehske, Holger: Increasing the performance of the Jacobi-Davidson method by blocking (2015)
  14. de la Cruz, Raúl; Araya-Polo, Mauricio: Algorithm 942: Semi-stencil (2014)
  15. Habich, J.; Feichtinger, C.; Köstler, H.; Hager, G.; Wellein, G.: Performance engineering for the lattice Boltzmann method on GPGPUs: architectural requirements and performance results (2013) ioport
  16. Sanchez, Luis Miguel; Fernandez, Javier; Sotomayor, Rafael; Escolar, Soledad; Garcia, J. Daniel: A comparative study and evaluation of parallel programming models for shared-memory parallel architectures (2013) ioport
  17. Klöfkorn, Robert: Efficient matrix-free implementation of discontinuous Galerkin methods for compressible flow problems (2012)
  18. Treibig, Jan; Hager, Georg; Wellein, Gerhard: LIKWID: lightweight performance tools (2011) ioport
  19. Treibig, Jan; Hager, Georg; Wellein, Gerhard: LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments (2010) ioport