ScaLAPACK

ScaLAPACK is an acronym for scalable linear algebra package or scalable LAPACK. It is a library of high-performance linear algebra routines for distributed memory message-passing MIMD computers and networks of workstations supporting parallel virtual machine (PVM) and/or message passing interface (MPI). It is a continuation of the LAPACK project, which designed and produced analogous software for workstations, vector supercomputers, and shared memory parallel computers. Both libraries contain routines for solving systems of linear equations, least squares problems, and eigenvalue problems. The goals of both projects are efficiency, scalability, reliability, portability, flexibility, and ease of use.\parScaLAPACK includes routines for the solution of dense, band, and tridiagonal linear systems of equations, condition estimation and iterative refinement, for LU and Cholesky factorization, matrix inversion, full-rank linear least squares problems, orthogonal and generalized orthogonal factorizations, orthogonal transformation routines, reductions to upper Hessenberg, bidiagonal and tridiagonal form, reduction of a symmetric-definite/Hermitian-definite generalized eigenproblem to standard form, the symmetric/Hermitian, generalized symmetric/Hermitian, and the nonsymmetric eigenproblem. Prototype codes are provided for out-of-core solvers for LU, Cholesky, and QR, the matrix sign function for eigenproblems, and an HPF interface to a subset of ScaLAPACK routines.\parSoftware is available in single precision real, double precision real, single precision complex, and double precision complex. The software has been written to be portable across a wide range of distributed-memory environments such as the Cray T3, IBM SP, Intel series, TM CM-5, clusters of workstations, and any system for which PVM or MPI is available.\parEach Users’ Guide includes a CD-ROM containing the HTML version of the ScaLAPACK Users’ Guide, the source code for the package, testing and timing programs, prebuilt version of the library for a number of computers, example programs, and the full set of LAPACK Working Notes.


References in zbMATH (referenced in 322 articles , 3 standard articles )

Showing results 1 to 20 of 322.
Sorted by year (citations)

1 2 3 ... 15 16 17 next

  1. Hadjiantoni, Stella; Kontoghiorghes, Erricos John: Estimating large-scale general linear and seemingly unrelated regressions models after deleting observations (2017)
  2. Stoykov, S.; Margenov, S.: Numerical methods and parallel algorithms for computation of periodic responses of plates (2017)
  3. Beliakov, Gleb; Matiyasevich, Yuri: A parallel algorithm for calculation of determinants and minors using arbitrary precision arithmetic (2016)
  4. Drmač, Zlatko; Gugercin, Serkan: A new selection operator for the discrete empirical interpolation method -- improved a priori error bound and extensions (2016)
  5. Houska, Boris; Frasch, Janick; Diehl, Moritz: An augmented Lagrangian based algorithm for distributed nonconvex optimization (2016)
  6. Lācis, Uǧis; Taira, Kunihiko; Bagheri, Shervin: A stable fluid-structure-interaction solver for low-density rigid bodies using the immersed boundary projection method (2016)
  7. Liu, Xiao; Xia, Jianlin; de Hoop, Maarten V.: Parallel randomized and matrix-free direct solvers for large structured dense linear systems (2016)
  8. Meiyue Shao, Chao Yang: BSEPACK User’s Guide (2016) arXiv
  9. Michailidis, Panagiotis D.; Margaritis, Konstantinos G.: Scientific computations on multi-core systems using different programming frameworks (2016)
  10. Schatz, Martin D.; van de Geijn, Robert A.; Poulson, Jack: Parallel matrix multiplication: a systematic journey (2016)
  11. Shao, Meiyue; da Jornada, Felipe H.; Yang, Chao; Deslippe, Jack; Louie, Steven G.: Structure preserving parallel algorithms for solving the Bethe-Salpeter eigenvalue problem (2016)
  12. Shevchenko, I.V.; Berloff, P.S.; Guerrero-López, D.; Roman, J.E.: On low-frequency variability of the midlatitude ocean gyres (2016)
  13. Stoykov, S.; Margenov, S.: Scalable parallel implementation of shooting method for large-scale dynamical systems. Application to bridge components (2016)
  14. Baboulin, M.; Dongarra, J.; Lacroix, R.: Computing least squares condition numbers on hybrid multicore/GPU systems (2015)
  15. Banerjee, Amartya S.; Elliott, Ryan S.; James, Richard D.: A spectral scheme for Kohn-Sham density functional theory of clusters (2015)
  16. Galizia, Antonella; D’Agostino, Daniele; Clematis, Andrea: An MPI-CUDA library for image processing on HPC architectures (2015)
  17. Ghosh, Debojyoti; Constantinescu, Emil M.; Brown, Jed: Efficient implementation of nonlinear compact schemes on massively parallel platforms (2015)
  18. Granat, Robert; Kågström, Bo; Kressner, Daniel; Shao, Meiyue: Algorithm 953: parallel library software for the multishift QR algorithm with aggressive early deflation (2015)
  19. Kolberg, Mariana; Bohlender, Gerd; Fernandes, Luiz Gustavo: An efficient approach to solve very large dense linear systems with verified computing on clusters. (2015)
  20. Van Zee, Field G.; van de Geijn, Robert A.: BLIS: a framework for rapidly instantiating BLAS functionality (2015)

1 2 3 ... 15 16 17 next