• LOCA

  • Referenced in 33 articles [sw04717]
  • differential equations, and to run on distributed memory parallel machines. The approach in LOCA...
  • PFFT

  • Referenced in 20 articles [sw07582]
  • fast Fourier transforms (FFTs) on massively parallel, distributed memory architectures based on the message passing ... established transpose FFT algorithms, we propose a parallel FFT framework that is based ... calculate pruned FFTs more efficiently on distributed memory architectures. For example, we provide performance measurements...
  • ALPS

  • Referenced in 19 articles [sw08907]
  • porting a serial code onto a parallel, distributed memory machine. Major changes in release...
  • XGBoost

  • Referenced in 139 articles [sw21035]
  • XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible ... Gradient Boosting framework. XGBoost provides a parallel tree boosting (also known as GBDT, GBM) that ... same code runs on major distributed environment (Hadoop, SGE, MPI) and can solve problems beyond...
  • Pluto

  • Referenced in 34 articles [sw09092]
  • Chombo library which provides a distributed infrastructure for parallel calculations over block-structured, adaptively refined...
  • ParaSCIP

  • Referenced in 32 articles [sw06292]
  • SCIP, which realizes a parallelization on a distributed memory computing environment. ParaSCIP uses SCIP solvers ... branching tree) locally. This makes the parallelization development independent of the SCIP development. Thus, ParaSCIP...
  • FLICA-OVAP

  • Referenced in 16 articles [sw18356]
  • research purpose. The architecture also enables distributed parallel calculations, multidisciplinary couplings (with the neutronics codes...
  • SCALEA

  • Referenced in 16 articles [sw02477]
  • SCALEA: A performance analysis tool for distributed and parallel programs. In this paper we present ... measurement, analysis, and visualization tool for parallel and distributed programs that supports post-mortem...
  • PSP

  • Referenced in 34 articles [sw07589]
  • preconditioner for heterogeneous 3D Helmholtz equations. A parallelization of a sweeping preconditioner for three-dimensional ... counts are reported for high-frequency problems distributed over thousands of cores. Two open-source ... with this paper: parallel sweeping preconditioner (PSP) and the underlying distributed multifrontal solver, clique...
  • clique

  • Referenced in 34 articles [sw07590]
  • preconditioner for heterogeneous 3D Helmholtz equations. A parallelization of a sweeping preconditioner for three-dimensional ... counts are reported for high-frequency problems distributed over thousands of cores. Two open-source ... with this paper: parallel sweeping preconditioner (PSP) and the underlying distributed multifrontal solver, clique...
  • MiBench

  • Referenced in 50 articles [sw04421]
  • SPEC benchmarks including instruction distribution, memory behavior, and available parallelism. The embedded benchmarks, called MiBench...
  • Elemental

  • Referenced in 25 articles [sw07035]
  • Framework for Distributed Memory Dense Matrix Computations. Parallelizing dense matrix computations to distributed memory architectures ... among the best understood domains of parallel computing. Two packages, developed in the mid 1990s ... very well take the shape of distributed memory architectures within a single processor, these packages...
  • PBGL

  • Referenced in 11 articles [sw04182]
  • Generic C++ Library for High-Performance Parallel and Distributed Graph Computation The Parallel BGL builds ... data structures, algorithms, and syntax for distributed, parallel computation that the BGL offers for sequential...
  • PARDISO

  • Referenced in 295 articles [sw00679]
  • systems of equations on shared-memory and distributed-memory multiprocessors. The solver has has been ... indefinite, hermitian. LU with complete pivoting. Parallel on SMPs and Cluster of SMPs. Automatic combination...
  • P - ARPACK

  • Referenced in 21 articles [sw09265]
  • portable large scale eigenvalue package for distributed memory parallel architectures. P_ARPACK is a parallel ... parallel implementation of ARPACK is presented which is portable across a wide range of distributed...
  • pARMS

  • Referenced in 39 articles [sw00683]
  • scientific and engineering applications. The most common parallel preconditioners used for sparse linear systems adapt ... more general framework of “distributed sparse linear systems”. The parallel Algebraic Recursive Multilevel Solver (pARMS...
  • AmgX

  • Referenced in 14 articles [sw13440]
  • large that they require large scale distributed parallel computing to obtain the solution of interest ... which provides drop-in GPU acceleration of distributed algebraic multigrid (AMG) and preconditioned iterative methods ... available multigrid methods or simpler preconditioners. The parallelism in the aggregation scheme exploits parallel graph...
  • IBAMR

  • Referenced in 11 articles [sw12603]
  • IBAMR: An adaptive and distributed-memory parallel implementation of the immersed boundary method. IBAMR: IBAMR ... distributed-memory parallel implementation of the immersed boundary (IB) method with support for Cartesian grid ... adaptive mesh refinement (AMR). Support for distributed-memory parallelism is via MPI, the Message Passing...
  • TAU

  • Referenced in 22 articles [sw10173]
  • pace with the growing complexity of parallel and distributed systems depends on robust performance frameworks ... presents the TAU (Tuning and Analysis Utilities) parallel performance sytem and describe how it addresses...
  • Dryad

  • Referenced in 21 articles [sw08916]
  • Dryad: distributed data-parallel programs from sequential building blocks. Dryad is a general-purpose distributed ... execution engine for coarse-grain data-parallel applications. A Dryad application combines computational ”vertices” with ... difficult problems of creating a large distributed, concurrent application: scheduling the use of computers...