GotoBLAS2 was released by the Texas Advanced Computing Center as open source software under the BSD license. This product is no longer under active development by TACC, but it is available to the community to use, study, and extend. GotoBLAS2 uses new algorithms and memory techniques for optimal performance of the BLAS routines. The changes in this version target new architecture features in microprocessors and interprocessor communication techniques. In addition, NUMA controls enhance multi-threaded execution of BLAS routines on node.
Keywords for this software
References in zbMATH (referenced in 8 articles )
Showing results 1 to 8 of 8.
- Williams, Ryan: Faster all-pairs shortest paths via circuit complexity (2014)
- Castaldo, Anthony M.; Whaley, R.Clint; Samuel, Siju: Scaling LAPACK panel operations using parallel cache assignment (2013)
- Hedtke, Ivo; Murthy, Sandeep: Search and test algorithms for triple product property triples. (2012)
- Bodrato, Marco: A Strassen-like matrix multiplication suited for squaring and higher power computation (2010)
- Chowdhury, Rezaul Alam; Ramachandran, Vijaya: The cache-oblivious Gaussian elimination paradigm: Theoretical framework, parallelization and Experimental evaluation (2010)
- Van Dyk, Danny; Geveler, Markus; Mallach, Sven; Ribbrock, Dirk; Göddeke, Dominik; Gutwenger, Carsten: HONEI: A collection of libraries for numerical computations targeting multiple processor architectures (2009)
- González, Manuel; González, Francisco; Dopico, Daniel; Luaces, Alberto: On the effect of linear algebra implementations in real-time multibody system dynamics (2008)
- Goto, Kazushige; van de Geijn, Robert A.: Anatomy of high-performance matrix multiplication. (2008)