
PLAPACK
 Referenced in 61 articles
[sw04268]
 parallel implementation of linear algebra algorithms and applications on distributed memory supercomputers such ... natural approach to encoding socalled blocked algorithms, which achieve high performance by operating ... centric approach to data distribution, sets PLAPACK apart from other parallel linear algebra libraries, allowing...

ParaDisEO
 Referenced in 41 articles
[sw01948]
 including evolutionary algorithms (EA), local searches (LS), the most common parallel and distributed models...

PARDISO
 Referenced in 255 articles
[sw00679]
 systems of equations on sharedmemory and distributedmemory multiprocessors. The solver has has been ... Parallel on SMPs and Cluster of SMPs. Automatic combination of iterative and direct solver algorithms...

Reduze
 Referenced in 69 articles
[sw10354]
 include the distributed reduction of single topologies on multiple processor cores. The parallel reduction ... system. Fast graph and matroid based algorithms allow for the identification of equivalent topologies...

PUMMA
 Referenced in 11 articles
[sw07819]
 PUMMA: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers. he paper describes Parallel ... Universal Matrix Multiplication Algorithms (PUMMA) on distributed memory concurrent computers. The PUMMA package includes ... block cyclic data distribution. The routines perform efficiently for a wide range of processor configurations ... BLAS routine xGEMM. Details of the parallel implementation of the routines are given, and results...

SPIRAL
 Referenced in 46 articles
[sw00903]
 Processing (DSP) algorithms, in particular fast transform algorithms such as the fast Fourier transform. SPIRAL ... platforms including SSE, multicore, Cell, GPU, distributed memory parallel processors, and FPGA, and has produced ... some of the fastest implementations of these algorithms on these platforms (SPIRAL is used...

PNFFT
 Referenced in 8 articles
[sw07583]
 parallel algorithm for calculating nonequispaced fast Fourier transforms on massively parallel distributed memory architectures ... serial algorithm due to the use of oversampled FFT. This algorithm has been implemented ... Furthermore, we derive a new parallel distributed memory algorithm for the fast computation of fully ... that an appropriate adjustment of the underlying parallel nonequispaced fast Fourier transform circumvents severe load...

GADGET
 Referenced in 41 articles
[sw12701]
 massively parallel supercomputers with distributed memory. While both versions use a tree algorithm to compute ... this study, we detail the numerical algorithms employed, and show various tests of the code ... release both the serial and the massively parallel version of the code...

GridSim
 Referenced in 49 articles
[sw01392]
 modeling and simulation of entities in parallel and distributed computing (PDC) systemsusers, applications, resources ... schedulers) for design and evaluation of scheduling algorithms. It provides a comprehensive facility for creating...

XGBoost
 Referenced in 40 articles
[sw21035]
 XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible ... machine learning algorithms under the Gradient Boosting framework. XGBoost provides a parallel tree boosting (also ... same code runs on major distributed environment (Hadoop, SGE, MPI) and can solve problems beyond...

PSPIKE
 Referenced in 14 articles
[sw07072]
 symmetric systems, real, parallel on distributedmemory clusters, combinatorial graph algorithms...

PBGL
 Referenced in 8 articles
[sw04182]
 HighPerformance Parallel and Distributed Graph Computation The Parallel BGL builds on the Boost Graph ... offering similar data structures, algorithms, and syntax for distributed, parallel computation that the BGL offers ... both experimentation with and comparison of parallel graph algorithms and to provide solid implementations...

EVPI
 Referenced in 18 articles
[sw02644]
 between processors. A parallel version of a sequential importance sampling solution algorithm based on local ... continuous distribution of possible realisations. It utilises the parallel nested Benders algorithm and a parallel...

Sips
 Referenced in 9 articles
[sw20921]
 matrices and the pathological eigenvalue distribution challenge the efficiency and robustness of the solver ... article, we present a parallel eigenvalue algorithm based on distributed spectrum slicing. We describe...

GloMoSim
 Referenced in 60 articles
[sw13764]
 include the null message and conditional event algorithms. The paper describes the GloMoSim library, addresses ... parallelization, and presents a set of experimental results on the IBM 9076 SP, a distributed...

SIMGRID
 Referenced in 26 articles
[sw10566]
 parallel applications over increasingly large sets of distributed resources. Consequently, the study of scheduling algorithms...

ParaKMeans
 Referenced in 4 articles
[sw29731]
 ParaKMeans: Implementation of a parallelized Kmeans algorithm suitable for general laboratory use. Background: During ... perform cluster analysis. While many clustering algorithms have been developed, they all suffer a significant ... clustering algorithms is to distribute or parallelize the algorithm across multiple computers. Results: The software...

Petabricks
 Referenced in 5 articles
[sw23582]
 Choices also include different automatic parallelization techniques, data distributions, algorithmic parameters, transformations, and blocking...

PFFT
 Referenced in 14 articles
[sw07582]
 fast Fourier transforms (FFTs) on massively parallel, distributed memory architectures based on the message passing ... Similar to established transpose FFT algorithms, we propose a parallel FFT framework that is based ... propose an algorithm to calculate pruned FFTs more efficiently on distributed memory architectures. For example...

PDAC
 Referenced in 5 articles
[sw02128]
 closed queueing networks A parallel distribution analysis by chain algorithm (PDAC) is presented ... class queueing networks. The PDAC algorithm uses data parallel computation of the summation indices needed...