• UNITY

  • Referenced in 184 articles [sw13461]
  • Warp protocol on a distributed memory parallel architecture...
  • FEAST

  • Referenced in 78 articles [sw04025]
  • performance, robustness, accuracy, and scalability on parallel architectures. This general purpose FEAST solver package includes...
  • BSPlib

  • Referenced in 42 articles [sw03374]
  • Portable and architecture independent parallel performance tuning using BSP. A call-graph profiling tool ... providing a mechanism for portable and architecture-independent parallel performance tuning. In order to test ... investigated on a number of different parallel architectures...
  • LAPACK

  • Referenced in 1626 articles [sw00503]
  • efficiently on shared-memory vector and parallel processors. On these machines, LINPACK and EISPACK ... block operations can be optimized for each architecture to account for the memory hierarchy...
  • TAO

  • Referenced in 43 articles [sw10597]
  • large optimization problems on high-performance parallel architectures. Our case study uses the GPCG (gradient...
  • Quantum Espresso

  • Referenced in 43 articles [sw06129]
  • with special attention paid to massively parallel architectures, and a great effort being devoted...
  • SuiteSparseQR

  • Referenced in 27 articles [sw07348]
  • obtain high performance on multicore architectures. Parallelism across different frontal matrices is handled with Intel...
  • DSDP5

  • Referenced in 26 articles [sw04411]
  • scalable performance for large problems on parallel architectures, and a well-documented interface and examples...
  • P - ARPACK

  • Referenced in 21 articles [sw09265]
  • scale eigenvalue package for distributed memory parallel architectures. P_ARPACK is a parallel version...
  • PFFT

  • Referenced in 16 articles [sw07582]
  • extension of FFTW to massively parallel architectures. We present an MPI based software library ... Fourier transforms (FFTs) on massively parallel, distributed memory architectures based on the message passing interface ... established transpose FFT algorithms, we propose a parallel FFT framework that is based ... pruned FFTs more efficiently on distributed memory architectures. For example, we provide performance measurements...
  • PFLOTRAN

  • Referenced in 23 articles [sw13428]
  • designed to run on massively parallel computing architectures as well as workstations and laptops. Parallelization...
  • ZRAM

  • Referenced in 38 articles [sw01038]
  • parallel computers of a variety of common architectures. This paper presents ZRAM, a portable parallel...
  • TELEMAC

  • Referenced in 10 articles [sw07470]
  • efficient hydrodynamics suite for massively parallel architectures. This paper investigates the use of TELEMAC ... Element-based hydrodynamics suite) on massively parallel computer architectures. The performance of TELEMAC is illustrated...
  • PIM

  • Referenced in 12 articles [sw08222]
  • modified for their efficient use on parallel architectures with either shared or distributed memory ... Results are presented for a variety of parallel computers...
  • EVPI

  • Referenced in 18 articles [sw02644]
  • multistage stochastic linear programmes on parallel MIMD architectures. Multistage stochastic linear programming has many practical ... problem. This paper describes a parallel implementation of the nested Benders algorithm which employs...
  • Exa-Dune

  • Referenced in 10 articles [sw32962]
  • exascale systems exhibiting a heterogeneous massively parallel architecture. In order to cope with the increased ... facilitated by exploiting massive coarse grained parallelism offered by multiscale and uncertainty quantification methods where...
  • M3D-C

  • Referenced in 10 articles [sw09200]
  • options are currently being updated for parallel architecture. Recent tokamak studies conducted with M3D have...
  • SPIKE

  • Referenced in 36 articles [sw02780]
  • linear solver SPIKE is proposed as a parallel environment for solving banded systems that ... system and the architecture of the high-end parallel computing platform. Numerical experiments are presented...
  • PLASMA

  • Referenced in 44 articles [sw12743]
  • Parallel Linear Algebra for Scalable Multi-core Architectures (PLASMA) project aims to address the critical...
  • Nektar++

  • Referenced in 69 articles [sw11964]
  • Exploiting batch processing on streaming architectures to solve 2D elliptic finite element problems: a hybridized ... local matrix generation stage coupled with the parallelization techniques developed for the linear system solvers ... good candidate for implementation on streaming architectures such as modern graphical processing units (GPUs ... method amenable to the fine-grained parallelism of GPUs. We demonstrate that the HDG method...