References in zbMATH (referenced in 16 articles )

Showing results 1 to 16 of 16.
Sorted by year (citations)

  1. Chang, J.; Karra, S.; Nakshatrala, K.B.: Large-scale optimization-based non-negative computational framework for diffusion equations: parallel implementation and performance studies (2017)
  2. Abdelfattah, Ahmad; Keyes, David; Ltaief, Hatem: KBLAS: an optimized library for dense matrix-vector multiplication on GPU accelerators (2016)
  3. D’Ambra, Pasqua; Filippone, Salvatore: A parallel generalized relaxation method for high-performance image segmentation on GPUs (2016)
  4. Ghysels, Pieter; Li, Xiaoye S.; Rouet, François-Henry; Williams, Samuel; Napov, Artem: An efficient multicore implementation of a novel HSS-structured multifrontal solver using randomized sampling (2016)
  5. Kronbichler, M.; Schoeder, S.; Müller, C.; Wall, W.A.: Comparison of implicit and explicit hybridizable discontinuous Galerkin methods for the acoustic wave equation (2016)
  6. Ghysels, Pieter; Vanroose, Wim: Modeling the performance of geometric multigrid stencils on multicore computer architectures (2015)
  7. Malas, T.; Hager, G.; Ltaief, H.; Stengel, H.; Wellein, G.; Keyes, D.: Multicore-optimized wavefront diamond blocking for optimizing stencil updates (2015)
  8. Röhrig-Zöllner, Melven; Thies, Jonas; Kreutzer, Moritz; Alvermann, Andreas; Pieper, Andreas; Basermann, Achim; Hager, Georg; Wellein, Gerhard; Fehske, Holger: Increasing the performance of the Jacobi-Davidson method by blocking (2015)
  9. Lee, Jaehwan; Keleher, Pete; Sussman, Alan: Exploiting multi-core nodes in peer-to-peer grids (2014) ioport
  10. Abed, Khalid H.; Morris, Gerald R.: Improving performance of codes with large/irregular stride memory access patterns via high performance reconfigurable computers (2013) ioport
  11. Wu, Xingfu; Taylor, Valerie: Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers (2013)
  12. Kronbichler, Martin; Kormann, Katharina: A generic interface for parallel cell-based finite element operator application (2012)
  13. Schönherr, M.; Kucher, K.; Geier, M.; Stiebler, M.; Freudiger, S.; Krafczyk, M.: Multi-thread implementations of the lattice Boltzmann method on non-uniform grids for cpus and gpus (2011) ioport
  14. Götz, J.; Iglberger, K.; Feichtinger, C.; Donath, S.; Rüde, U.: Coupling multibody dynamics and computational fluid dynamics on 8192 processor cores (2010)
  15. Saini, Subhash; Ciotti, Robert; Gunney, Brian T.N.; Spelce, Thomas E.; Koniges, Alice; Dossa, Don; Adamidis, Panagiotis; Rabenseifner, Rolf; Tiyyagura, Sunil R.; Mueller, Matthias: Performance evaluation of supercomputers using HPCC and IMB benchmarks (2008)
  16. an Mey, Dieter; Sarholz, Samuel; Terboven, Christian: Nested parallelization with openMP (2007)