• PLAPACK

  • Referenced in 61 articles [sw04268]
  • parallel implementation of linear algebra algorithms and applications on distributed memory supercomputers such ... natural approach to encoding so-called blocked algorithms, which achieve high performance by operating ... centric approach to data distribution, sets PLAPACK apart from other parallel linear algebra libraries, allowing...
  • ParaDisEO

  • Referenced in 41 articles [sw01948]
  • including evolutionary algorithms (EA), local searches (LS), the most common parallel and distributed models...
  • PARDISO

  • Referenced in 255 articles [sw00679]
  • systems of equations on shared-memory and distributed-memory multiprocessors. The solver has has been ... Parallel on SMPs and Cluster of SMPs. Automatic combination of iterative and direct solver algorithms...
  • Reduze

  • Referenced in 69 articles [sw10354]
  • include the distributed reduction of single topologies on multiple processor cores. The parallel reduction ... system. Fast graph and matroid based algorithms allow for the identification of equivalent topologies...
  • PUMMA

  • Referenced in 11 articles [sw07819]
  • PUMMA: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers. he paper describes Parallel ... Universal Matrix Multiplication Algorithms (PUMMA) on distributed memory concurrent computers. The PUMMA package includes ... block cyclic data distribution. The routines perform efficiently for a wide range of processor configurations ... BLAS routine xGEMM. Details of the parallel implementation of the routines are given, and results...
  • SPIRAL

  • Referenced in 46 articles [sw00903]
  • Processing (DSP) algorithms, in particular fast transform algorithms such as the fast Fourier transform. SPIRAL ... platforms including SSE, multicore, Cell, GPU, distributed memory parallel processors, and FPGA, and has produced ... some of the fastest implementations of these algorithms on these platforms (SPIRAL is used...
  • PNFFT

  • Referenced in 8 articles [sw07583]
  • parallel algorithm for calculating nonequispaced fast Fourier transforms on massively parallel distributed memory architectures ... serial algorithm due to the use of oversampled FFT. This algorithm has been implemented ... Furthermore, we derive a new parallel distributed memory algorithm for the fast computation of fully ... that an appropriate adjustment of the underlying parallel nonequispaced fast Fourier transform circumvents severe load...
  • GADGET

  • Referenced in 41 articles [sw12701]
  • massively parallel supercomputers with distributed memory. While both versions use a tree algorithm to compute ... this study, we detail the numerical algorithms employed, and show various tests of the code ... release both the serial and the massively parallel version of the code...
  • GridSim

  • Referenced in 49 articles [sw01392]
  • modeling and simulation of entities in parallel and distributed computing (PDC) systems-users, applications, resources ... schedulers) for design and evaluation of scheduling algorithms. It provides a comprehensive facility for creating...
  • XGBoost

  • Referenced in 40 articles [sw21035]
  • XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible ... machine learning algorithms under the Gradient Boosting framework. XGBoost provides a parallel tree boosting (also ... same code runs on major distributed environment (Hadoop, SGE, MPI) and can solve problems beyond...
  • PSPIKE

  • Referenced in 14 articles [sw07072]
  • symmetric systems, real, parallel on distributed-memory clusters, combinatorial graph algorithms...
  • PBGL

  • Referenced in 8 articles [sw04182]
  • High-Performance Parallel and Distributed Graph Computation The Parallel BGL builds on the Boost Graph ... offering similar data structures, algorithms, and syntax for distributed, parallel computation that the BGL offers ... both experimentation with and comparison of parallel graph algorithms and to provide solid implementations...
  • EVPI

  • Referenced in 18 articles [sw02644]
  • between processors. A parallel version of a sequential importance sampling solution algorithm based on local ... continuous distribution of possible realisations. It utilises the parallel nested Benders algorithm and a parallel...
  • Sips

  • Referenced in 9 articles [sw20921]
  • matrices and the pathological eigenvalue distribution challenge the efficiency and robustness of the solver ... article, we present a parallel eigenvalue algorithm based on distributed spectrum slicing. We describe...
  • GloMoSim

  • Referenced in 60 articles [sw13764]
  • include the null message and conditional event algorithms. The paper describes the GloMoSim library, addresses ... parallelization, and presents a set of experimental results on the IBM 9076 SP, a distributed...
  • SIMGRID

  • Referenced in 26 articles [sw10566]
  • parallel applications over increasingly large sets of distributed resources. Consequently, the study of scheduling algorithms...
  • ParaKMeans

  • Referenced in 4 articles [sw29731]
  • ParaKMeans: Implementation of a parallelized K-means algorithm suitable for general laboratory use. Background: During ... perform cluster analysis. While many clustering algorithms have been developed, they all suffer a significant ... clustering algorithms is to distribute or parallelize the algorithm across multiple computers. Results: The software...
  • Petabricks

  • Referenced in 5 articles [sw23582]
  • Choices also include different automatic parallelization techniques, data distributions, algorithmic parameters, transformations, and blocking...
  • PFFT

  • Referenced in 14 articles [sw07582]
  • fast Fourier transforms (FFTs) on massively parallel, distributed memory architectures based on the message passing ... Similar to established transpose FFT algorithms, we propose a parallel FFT framework that is based ... propose an algorithm to calculate pruned FFTs more efficiently on distributed memory architectures. For example...
  • PDAC

  • Referenced in 5 articles [sw02128]
  • closed queueing networks A parallel distribution analysis by chain algorithm (PDAC) is presented ... class queueing networks. The PDAC algorithm uses data parallel computation of the summation indices needed...