CUSPARSE

The CUSPARSE library contains a set of basic linear algebra subroutines used for handling sparse matrices. It is implemented on top of the NVIDIA® CUDA™ runtime (which is part of the CUDA Toolkit) and is designed to be called from C and C++. The library routines can be classified into four categories: Level 1: operations between a vector in sparse format and a vector in dense format; Level 2: operations between a matrix in sparse format and a vector in dense format; Level 3: operations between a matrix in sparse format and a set of vectors in dense format (which can also usually be viewed as a dense tall matrix); Conversion: operations that allow conversion between different matrix formats.


References in zbMATH (referenced in 14 articles )

Showing results 1 to 14 of 14.
Sorted by year (citations)

  1. Bernaschi, Massimo; Bisson, Mauro; Fantozzi, Carlo; Janna, Carlo: A factored sparse approximate inverse preconditioned conjugate gradient solver on graphics processing units (2016)
  2. D’Ambra, Pasqua; Filippone, Salvatore: A parallel generalized relaxation method for high-performance image segmentation on GPUs (2016)
  3. Gremse, Felix; Höfter, Andreas; Schwen, Lars Ole; Kiessling, Fabian; Naumann, Uwe: GPU-accelerated sparse matrix-matrix multiplication by iterative row merging (2015)
  4. Magoulès, Frédéric; Ahamed, Abal-Kassim Cheik; Putanowicz, Roman: Auto-tuned Krylov methods on cluster of graphics processing unit (2015)
  5. Mironowicz, P.; Dziekonski, A.; Mrozowski, M.: A task-scheduling approach for efficient sparse symmetric matrix-vector multiplication on a GPU (2015)
  6. Naumov, M.; Arsaev, M.; Castonguay, P.; Cohen, J.; Demouth, J.; Eaton, J.; Layton, S.; Markovskiy, N.; Reguly, I.; Sakharnykh, N.; Sellappan, V.; Strzodka, R.: AmgX: a library for GPU accelerated algebraic multigrid and preconditioned iterative methods (2015)
  7. Birk, Matthias; Dapp, Robin; Ruiter, N.V.; Becker, J.: GPU-based iterative transmission reconstruction in 3D ultrasound computer tomography (2014)
  8. Chang, Li-Wen; Hwu, Wen-Mei W.: A guide for implementing tridiagonal solvers on GPUs (2014)
  9. Gao, Jiaquan; Liang, Ronghua; Wang, Jun: Research on the conjugate gradient algorithm with a modified incomplete Cholesky preconditioner on GPU (2014)
  10. Koza, Zbigniew; Matyka, Maciej; Mirosław, Łukasz; Poła, Jakub: Sparse matrix-vector product (2014)
  11. Demidov, Denis; Ahnert, Karsten; Rupp, Karl; Gottschling, Peter: Programming CUDA and OpenCL: a case study using modern C++ libraries (2013)
  12. Knepley, Matthew G.; Terrel, Andy R.: Finite element integration on GPGPUs (2013)
  13. Galiano, V.; Migallón, H.; Migallón, V.; Penadés, J.: GPU-based parallel algorithms for sparse nonlinear systems (2012)
  14. Oberhuber, Tomáš; Heller, Martin: Improved row-grouped CSR format for storing of sparse matrices on GPU (2012)