PATUS
PATUS: A code generation and autotuning framework for parallel iterative stencil computations on modern microarchitectures. Stencil calculations comprise an important class of kernels in many scientific computing applications ranging from simple PDE solvers to constituent kernels in multigrid methods as well as image processing applications. In such types of solvers, stencil kernels are often the dominant part of the computation, and an efficient parallel implementation of the kernel is therefore crucial in order to reduce the time to solution. However, in the current complex hardware micro architectures, meticulous architecture-specific tuning is required to elicit the machine’s full compute power. We present a code generation and auto-tuning framework textsc{Patus} for stencil computations targeted at multi- and many core processors, such as multicore CPUs and graphics processing units, which makes it possible to generate compute kernels from a specification of the stencil operation and a parallelization and optimization strategy, and leverages the auto tuning methodology to optimize strategy-dependent parameters for the given hardware architecture.
Keywords for this software
References in zbMATH (referenced in 8 articles )
Showing results 1 to 8 of 8.
Sorted by year (- Köstler, Harald; Heisig, Marco; Kohl, Nils; Kuckuk, Sebastian; Bauer, Martin; Rüde, Ulrich: Code generation approaches for parallel geometric multigrid solvers (2020)
- Gadioli, Davide; Vitali, Emanuele; Palermo, Gianluca; Silvano, Cristina: mARGOt: a dynamic autotuning framework for self-aware approximate computing (2019)
- Hückelheim, J. C.; Hovland, P. D.; Strout, M. M.; Müller, J.-D.: Parallelizable adjoint stencil computations using transposed forward-mode algorithmic differentiation (2018)
- Ghysels, Pieter; Vanroose, Wim: Modeling the performance of geometric multigrid stencils on multicore computer architectures (2015)
- Malas, T.; Hager, G.; Ltaief, H.; Stengel, H.; Wellein, G.; Keyes, D.: Multicore-optimized wavefront diamond blocking for optimizing stencil updates (2015)
- Mo, Tieqiang; Li, Renfa: A new memory mapping mechanism for GPGPUs’ stencil computation (2015)
- Hupp, Philipp; Jacob, Riko: Tight bounds for low dimensional star stencils in the external memory model (2013)
- Yang, Yang; Cui, Hui-Min; Feng, Xiao-Bing; Xue, Jing-Ling: A hybrid circular queue method for iterative stencil computations on GPUs (2012) ioport