
PNFFT
 Referenced in 5 articles
[sw07583]
 Parallel threedimensional nonequispaced fast Fourier transforms and their application to particle simulation Starting from ... algorithm for calculating nonequispaced fast Fourier transforms on massively parallel distributed memory architectures. We demonstrate ... derive a new parallel distributed memory algorithm for the fast computation of fully Coulomb interactions ... appropriate adjustment of the underlying parallel nonequispaced fast Fourier transform circumvents severe load imbalance...

PFFT
 Referenced in 8 articles
[sw07582]
 software library for computing fast Fourier transforms (FFTs) on massively parallel, distributed memory architectures based...

SPIRAL
 Referenced in 38 articles
[sw00903]
 fast transform algorithms such as the fast Fourier transform. SPIRAL is capable of generating optimized ... including SSE, multicore, Cell, GPU, distributed memory parallel processors, and FPGA, and has produced some ... implementations and uses intelligent search to find fast implementations. This talk provides an overview...

SWIFFT
 Referenced in 32 articles
[sw11588]
 novel use of the Fast Fourier Transform (FFT) to achieve “diffusion,” together with a linear ... performance software implementation that exploits the inherent parallelism of the FFT algorithm. The throughput...

P3DFFT
 Referenced in 7 articles
[sw06503]
 Fourier transforms in three dimensions Fourier and related transforms are a family of algorithms widely ... notoriously difficult to scale on highperformance parallel computers with a large number of processing ... software package called P3DFFT which implements fast Fourier transforms (FFTs) in three dimensions...

cuFFT
 Referenced in 17 articles
[sw11258]
 document describes cuFFT, the NVIDIA® CUDA™ Fast Fourier Transform (FFT) product. It consists ... conquer algorithm for efficiently computing discrete Fourier transforms of complex or realvalued data sets ... quickly leverage the floatingpoint power and parallelism of the GPU in a highly optimized...

VSIPL++
 Referenced in 2 articles
[sw14039]
 foundations and expected performance of the library. Parallel VSIPL++ supports adaptive optimization at many levels ... compiler. The computation objects (e.g., fast Fourier transforms) are built with explicit setup ... stages to allow for runtime optimization. Parallel arrays and functions in Parallel VSIPL++ also support...

kWave
 Referenced in 8 articles
[sw07387]
 planar detector geometry based on the fast Fourier transform (FFT) is also included. The architecture ... optimization of computational speed is demonstrated through parallel execution using a graphics processing unit...

DL_POLY_3
 Referenced in 3 articles
[sw09141]
 embedding a highly efficient domain decomposition (DD) parallelization strategy. It was developed at Daresbury Laboratory ... simulations, incorporating a novel threedimensional fast Fourier transform (the Daresbury Advanced Fourier Transform), makes...

Pescan
 Referenced in 1 article
[sw20134]
 parallel energy scan (PESCAN) code. Pescan is used for nonselfconsistent calculations of electron and hole ... plane wave basis set. Fast Fourier transform (FFT) is used to transform the wavefunction from...

ANSYS
 Referenced in 404 articles
[sw00044]
 ANSYS offers a comprehensive software suite that spans...

ATLAS
 Referenced in 182 articles
[sw00056]
 This paper describes the Automatically Tuned Linear Algebra...

ACL2
 Referenced in 155 articles
[sw00060]
 ACL2 is both a programming language in which...

BiCG
 Referenced in 34 articles
[sw00076]
 BiCG: An effective solver for three fields...

Expokit
 Referenced in 106 articles
[sw00258]
 Expokit provides a set of routines aimed at...

FGb
 Referenced in 216 articles
[sw00286]
 FGb/Gb libraryGb is a program (191 420 lines...

GAP
 Referenced in 1803 articles
[sw00320]
 GAP is a system for computational discrete algebra...

gmp
 Referenced in 164 articles
[sw00363]
 GMP is a free library for arbitrary precision...

LAPACK
 Referenced in 1222 articles
[sw00503]
 LAPACK is written in Fortran 90 and provides...