Exploiting batch processing on streaming architectures to solve 2D elliptic finite element problems: a hybridized discontinuous Galerkin (HDG) case study. Numerical methods for elliptic partial differential equations (PDEs) within both continuous and hybridized discontinuous Galerkin (HDG) frameworks share the same general structure: local (elemental) matrix generation followed by a global linear system assembly and solve. The lack of inter-element communication and easily parallelizable nature of the local matrix generation stage coupled with the parallelization techniques developed for the linear system solvers make a numerical scheme for elliptic PDEs a good candidate for implementation on streaming architectures such as modern graphical processing units (GPUs). We propose an algorithmic pipeline for mapping an elliptic finite element method to the GPU and perform a case study for a particular method within the HDG framework. This study provides comparison between CPU and GPU implementations of the method as well as highlights certain performance-crucial implementation details. The choice of the HDG method for the case study was dictated by the computationally-heavy local matrix generation stage as well as the reduced trace-based communication pattern, which together make the method amenable to the fine-grained parallelism of GPUs. We demonstrate that the HDG method is well-suited for GPU implementation, obtaining total speedups on the order of 30-35 times over a serial CPU implementation for moderately sized problems.

References in zbMATH (referenced in 76 articles , 2 standard articles )

Showing results 1 to 20 of 76.
Sorted by year (citations)

1 2 3 4 next

  1. Jiang, Hongyi: Formation mechanism of a secondary vortex street in a cylinder wake (2021)
  2. Krais, Nico; Beck, Andrea; Bolemann, Thomas; Frank, Hannes; Flad, David; Gassner, Gregor; Hindenlang, Florian; Hoffmann, Malte; Kuhn, Thomas; Sonntag, Matthias; Munz, Claus-Dieter: FLEXI: a high order discontinuous Galerkin framework for hyperbolic-parabolic conservation laws (2021)
  3. Pintore, Moreno; Pichi, Federico; Hess, Martin; Rozza, Gianluigi; Canuto, Claudio: Efficient computation of bifurcation diagrams with a deflated approach to reduced basis spectral element method (2021)
  4. Tonicello, Niccolò; Lodato, Guido; Vervisch, Luc: A comparative study from spectral analyses of high-order methods with non-constant advection velocities (2021)
  5. Yan, Zhen-Guo; Pan, Yu; Castiglioni, Giacomo; Hillewaert, Koen; Peiró, Joaquim; Moxey, David; Sherwin, Spencer J.: Nektar++: design and implementation of an implicit, spectral/(hp) element, compressible flow solver using a Jacobian-free Newton Krylov approach (2021)
  6. Cheng, Liang; Ju, Xiaoying; Tong, Feifei; An, Hongwei: Transition to chaos through period doublings of a forced oscillating cylinder in steady current (2020)
  7. Gupta, Vikrant; He, Wei; Wan, Minping; Chen, Shiyi; Li, Larry K. B.: A Ginzburg-Landau model for linear global modes in open shear flows (2020)
  8. Ju, Xiaoying; An, Hongwei; Cheng, Liang; Tong, Feifei: Modes of synchronisation around a near-wall oscillating cylinder in streamwise directions (2020)
  9. Kumar, Abhishek; Pothérat, Alban: Mixed baroclinic convection in a cavity (2020)
  10. Matteo Giacomini, Ruben Sevilla, Antonio Huerta: HDGlab: An open-source implementation of the hybridisable discontinuous Galerkin method in MATLAB (2020) arXiv
  11. Moratilla-Vega, M. A.; Lackhove, K.; Janicka, J.; Xia, H.; Page, G. J.: Jet noise analysis using an efficient LES/high-order acoustic coupling method (2020)
  12. Moura, Rodrigo C.; Aman, Mansoor; Peiró, Joaquim; Sherwin, Spencer J.: Spatial eigenanalysis of spectral/\textithpcontinuous Galerkin schemes and their stabilisation via DG-mimicking spectral vanishing viscosity for high Reynolds number flows (2020)
  13. Moxey, David; Amici, Roman; Kirby, Mike: Efficient matrix-free high-order finite element evaluation for simplicial elements (2020)
  14. Nordström, Jan; Hagstrom, Thomas M.: The number of boundary conditions for initial boundary value problems (2020)
  15. Önder, Asim; Liu, Philip L.-F.: Stability of the solitary wave boundary layer subject to finite-amplitude disturbances (2020)
  16. Puligilla, Shivakanth Chary; Jayaraman, Balaji: Assessment of end-to-end and sequential data-driven learning for non-intrusive modeling of fluid flows (2020)
  17. Xiong, Chengwang; Qi, Xiang; Gao, Ankang; Xu, Hui; Ren, Chengjiao; Cheng, Liang: The bypass transition mechanism of the Stokes boundary layer in the intermittently turbulent regime (2020)
  18. Zhang, Kai; Hayostek, Shelby; Amitay, Michael; He, Wei; Theofilis, Vassilios; Taira, Kunihiko: On the formation of three-dimensional separated flows over wings under tip effects (2020)
  19. Cantwell, Chris D.; Nielsen, Allan S.: A minimally intrusive low-memory approach to resilience for existing transient solvers (2019)
  20. Cervi, Jessica; Spiteri, Raymond J.: A comparison of fourth-order operator splitting methods for cardiac simulations (2019)

1 2 3 4 next

Further publications can be found at: