• VTune

  • Referenced in 18 articles [sw08852]
  • data to tune CPU & GPU compute performance, multi-core scalability, bandwidth and more. Sort, filter...
  • OpenGL

  • Referenced in 124 articles [sw06740]
  • computer graphics. The API is typically used to interact with a Graphics processing unit (GPU...
  • CUBLAS

  • Referenced in 59 articles [sw06880]
  • access the computational resources of NVIDIA Graphics Processing Unit (GPU), but does not auto-parallelize...
  • GTEngine

  • Referenced in 31 articles [sw24041]
  • engine also supports high-performance computing using general purpose GPU programming (GPGPU). SIMD code...
  • GPUTeraSort

  • Referenced in 14 articles [sw12706]
  • memory-intensive and compute-intensive threads on the GPU. Our new sorting architecture provides multiple ... GPU along with the main memory interface for CPU computations. As a result, we achieve ... communication bandwidth between the CPU and the GPU, and reduces the data communication between...
  • GPGPU

  • Referenced in 20 articles [sw09105]
  • GPGPU: general-purpose computation on graphics hardware. The graphics processor (GPU) on today’s commodity ... graphics architectures provide tremendous memory bandwidth and computational horsepower, with dozens of fully programmable shading ... computation on graphics hardware. We emphasize core computational building blocks, ranging from linear algebra ... review the tools, perils, and strategies in GPU programming. We present analysis of GPU performance...
  • hiCUDA

  • Referenced in 6 articles [sw12727]
  • different ways of identifying and extracting GPU computation, and of managing the GPU memory. Along...
  • cuFFT

  • Referenced in 21 articles [sw11258]
  • simple interface for computing FFTs on an NVIDIA GPU, which allows users to quickly leverage...
  • gputools

  • Referenced in 4 articles [sw14139]
  • gputools package enables GPU computing in R. Motivation: By default, the R statistical environment does ... processing units (GPUs) provide an inexpensive and computationally powerful alternative. Using R and the CUDA ... microarray gene expression analysis for GPU-equipped computers. Results: R users can take advantage ... better performance provided by an Nvidia GPU. Availability: The package is available from CRAN...
  • Jacket

  • Referenced in 4 articles [sw11529]
  • enables standard Matlab. It greatly simplifies GPU computing for engineers, scientists, and technical computing professionals ... Jacket, most Matlab users are using the GPU and obtaining significant performance gains...
  • StarPU

  • Referenced in 32 articles [sw14216]
  • computer mixing IBM Cell Broadband Engines and AMD opteron processors. Other architectures, featuring GPU accelerators...
  • Nektar++

  • Referenced in 43 articles [sw11964]
  • This study provides comparison between CPU and GPU implementations of the method as well ... case study was dictated by the computationally-heavy local matrix generation stage as well ... method is well-suited for GPU implementation, obtaining total speedups on the order...
  • SpGEMM

  • Referenced in 5 articles [sw14033]
  • well as with three GPU-based implementations. Measurements performed for computing the matrix square ... GPU caching architecture. An improved performance was also found for computing Galerkin products which...
  • BSGP

  • Referenced in 5 articles [sw08995]
  • programming language for general purpose computation on the GPU. A BSGP program looks much...
  • CULA

  • Referenced in 9 articles [sw12745]
  • modern graphics processing unit (GPU) found in many standard personal computers is a highly parallel ... ratio. High-level linear algebra operations are computationally intense, often requiring O(N3) operations ... processing power of the GPU. Our work is on CULA, a GPU accelerated implementation...
  • Theano

  • Referenced in 39 articles [sw05894]
  • integration with numpy, transparent use of a GPU, efficient symbolic differentiation, speed and stability optimizations ... verification. Theano has been powering large-scale computationally intensive scientific investigations since...
  • DeCo

  • Referenced in 1 article [sw24120]
  • computing and for graphical process unit (GPU) parallel computing. For the GPU implementation ... show how to use general purpose GPU computing almost effortlessly. This GPU implementation provides ... package and the computational gain of the GPU version through some simulation experiments and empirical...
  • EAGL

  • Referenced in 2 articles [sw08231]
  • EAGL), a self-contained GPU library, to support parallel computing of bilinear pairings based ... GPU pipeline vs. memory access latency are highly complex for parallelization of pairing computations. Overall ... main performance bottleneck for pairing computations on the tested GPU device, and the lazy reduction ... offer substantial performance improvement for GPU-based pairing computations...
  • GPU Quicksort

  • Referenced in 8 articles [sw12707]
  • graphics processors. In this article, we describe GPU-quicksort, an efficient quicksort algorithm suitable ... programing platform for general-purpose computations on graphical processors, GPU-quicksort performs better than...
  • Gunrock

  • Referenced in 2 articles [sw27063]
  • expressiveness by coupling high performance GPU computing primitives and optimization strategies with a high-level ... primitives with small code size and minimal GPU programming knowledge...