PFFT: An extension of FFTW to massively parallel architectures. We present an MPI based software library for computing fast Fourier transforms (FFTs) on massively parallel, distributed memory architectures based on the message passing interface standard (MPI). Similar to established transpose FFT algorithms, we propose a parallel FFT framework that is based on a combination of local FFTs, local data permutations, and global data transpositions. This framework can be generalized to arbitrary multidimensional data and process meshes. All performance-relevant building blocks can be implemented with the help of the FFTW software library. Therefore, our library offers great flexibility and portable performance. Similarly to FFTW, we are able to compute FFTs of complex data, real data, and even- or odd-symmetric real data. All the transforms can be performed completely in place. Furthermore, we propose an algorithm to calculate pruned FFTs more efficiently on distributed memory architectures. For example, we provide performance measurements of FFTs of sizes between $512^3$ and $8192^3$ up to 262144 cores on a BlueGene/P architecture, up to 32768 cores on a BlueGene/Q architecture, and up to 4096 cores on the J”ulich Research on Petaflop Architectures (JuRoPA).

References in zbMATH (referenced in 20 articles , 1 standard article )

Showing results 1 to 20 of 20.
Sorted by year (citations)

  1. Bauer, Martin; Eibl, Sebastian; Godenschwager, Christian; Kohl, Nils; Kuron, Michael; Rettinger, Christoph; Schornbaum, Florian; Schwarzmeier, Christoph; Thönnes, Dominik; Köstler, Harald; Rüde, Ulrich: \textscwaLBerla: a block-structured high-performance framework for multiphysics simulations (2021)
  2. Caprace, Denis-Gabriel; Gillis, Thomas; Chatelain, Philippe: FLUPS: a Fourier-based library of unbounded Poisson solvers (2021)
  3. Dudal, David; Oliveira, Orlando; Roelfs, Martin; Silva, Paulo: Spectral representation of lattice gluon and ghost propagators at zero temperature (2020)
  4. Kwon, Oh-Kyoung; Lee, Jin; Lee, Junghoon; Kang, Ji-Hoon; Choi, Jung-Il: MPI parallel implementation for pseudo-spectral simulations for turbulent channel flow (2020)
  5. Gander, Martin J.; Wu, Shu-Lin: Convergence analysis of a \textitperiodic-like waveform relaxation method for initial-value problems via the diagonalization technique (2019)
  6. Jaber J. Hasbestan, Inanc Senocak: PittPack: An Open-Source Poisson’s Equation Solver for Extreme-Scale Computing with Accelerators (2019) arXiv
  7. Ashwin Vishnu Mohanan, Cyrille Bonamy, Pierre Augier: FluidFFT: common API (C++ and Python) for Fast Fourier Transform HPC libraries (2018) arXiv
  8. Osborn, Sarah; Zulian, Patrick; Benson, Thomas; Villa, Umberto; Krause, Rolf; Vassilevski, Panayot S.: Scalable hierarchical PDE sampler for generating spatially correlated random fields using nonmatching meshes. (2018)
  9. Plonka, Gerlind; Potts, Daniel; Steidl, Gabriele; Tasche, Manfred: Numerical Fourier analysis (2018)
  10. Hofmann, Michael; Nestler, Franziska; Pippig, Michael: NFFT based Ewald summation for electrostatic systems with charges and dipoles (2017)
  11. Lu, Jianfeng; Yang, Haizhao: Preconditioning orbital minimization method for planewave discretization (2017)
  12. Gholami, Amir; Malhotra, Dhairya; Sundar, Hari; Biros, George: FFT, FMM, or multigrid? A comparative study of state-of-the-art Poisson solvers for uniform and nonuniform grids in the unit cube (2016)
  13. Jung, Jaewoon; Kobayashi, Chigusa; Imamura, Toshiyuki; Sugita, Yuji: Parallel implementation of 3D FFT with volumetric decomposition schemes for efficient molecular dynamics simulations (2016)
  14. Mortensen, Mikael; Langtangen, Hans Petter: High performance python for direct numerical simulations of turbulent flows (2016)
  15. Fuka, V.: PoisFFT -- a free parallel fast Poisson solver (2015)
  16. Nestler, Franziska; Pippig, Michael; Potts, Daniel: Fast ewald summation based on NFFT with mixed periodicity (2015)
  17. Wiens, Jeffrey K.; Stockie, John M.: An efficient parallel immersed boundary algorithm using a pseudo-compressible fluid solver (2015)
  18. Browne, P. A.; Budd, C. J.; Piccolo, C.; Cullen, M.: Fast three dimensional r-adaptive mesh redistribution (2014)
  19. Pippig, Michael: PFFT: An extension of FFTW to massively parallel architectures (2013)
  20. Pippig, Michael; Potts, Daniel: Parallel three-dimensional nonequispaced fast Fourier transforms and their application to particle simulation (2013)