• VTune

  • Referenced in 19 articles [sw08852]
  • data to tune CPU & GPU compute performance, multi-core scalability, bandwidth and more. Sort, filter...
  • CUBLAS

  • Referenced in 75 articles [sw06880]
  • access the computational resources of NVIDIA Graphics Processing Unit (GPU), but does not auto-parallelize...
  • OpenGL

  • Referenced in 131 articles [sw06740]
  • computer graphics. The API is typically used to interact with a Graphics processing unit (GPU...
  • GTEngine

  • Referenced in 38 articles [sw24041]
  • engine also supports high-performance computing using general purpose GPU programming (GPGPU). SIMD code...
  • gputools

  • Referenced in 6 articles [sw14139]
  • gputools package enables GPU computing in R. Motivation: By default, the R statistical environment does ... processing units (GPUs) provide an inexpensive and computationally powerful alternative. Using R and the CUDA ... microarray gene expression analysis for GPU-equipped computers. Results: R users can take advantage ... better performance provided by an Nvidia GPU. Availability: The package is available from CRAN...
  • GPUTeraSort

  • Referenced in 14 articles [sw12706]
  • memory-intensive and compute-intensive threads on the GPU. Our new sorting architecture provides multiple ... GPU along with the main memory interface for CPU computations. As a result, we achieve ... communication bandwidth between the CPU and the GPU, and reduces the data communication between...
  • GPGPU

  • Referenced in 20 articles [sw09105]
  • GPGPU: general-purpose computation on graphics hardware. The graphics processor (GPU) on today’s commodity ... graphics architectures provide tremendous memory bandwidth and computational horsepower, with dozens of fully programmable shading ... computation on graphics hardware. We emphasize core computational building blocks, ranging from linear algebra ... review the tools, perils, and strategies in GPU programming. We present analysis of GPU performance...
  • cuFFT

  • Referenced in 24 articles [sw11258]
  • simple interface for computing FFTs on an NVIDIA GPU, which allows users to quickly leverage...
  • DeCo

  • Referenced in 4 articles [sw24120]
  • computing and for graphical process unit (GPU) parallel computing. For the GPU implementation ... show how to use general purpose GPU computing almost effortlessly. This GPU implementation provides ... package and the computational gain of the GPU version through some simulation experiments and empirical...
  • hiCUDA

  • Referenced in 6 articles [sw12727]
  • different ways of identifying and extracting GPU computation, and of managing the GPU memory. Along...
  • Zippy

  • Referenced in 6 articles [sw34497]
  • Zippy: A Framework for Computation and Visualization on a GPU Cluster. Due to its high ... GPU cluster is an attractive platform for large scale general-purpose computation and visualization applications ... model for high performance general- purpose computation on GPU clusters remains a complex problem ... parallel visualiza- tion, graphics, and computation modules on a GPU cluster...
  • Nektar++

  • Referenced in 69 articles [sw11964]
  • This study provides comparison between CPU and GPU implementations of the method as well ... case study was dictated by the computationally-heavy local matrix generation stage as well ... method is well-suited for GPU implementation, obtaining total speedups on the order...
  • Theano

  • Referenced in 74 articles [sw05894]
  • integration with numpy, transparent use of a GPU, efficient symbolic differentiation, speed and stability optimizations ... verification. Theano has been powering large-scale computationally intensive scientific investigations since...
  • CHeart

  • Referenced in 5 articles [sw15301]
  • well as the leading edge of GPU computational technologies. CHeart is a multi-disciplinary effort...
  • StarPU

  • Referenced in 38 articles [sw14216]
  • computer mixing IBM Cell Broadband Engines and AMD opteron processors. Other architectures, featuring GPU accelerators...
  • Gunrock

  • Referenced in 4 articles [sw27063]
  • expressiveness by coupling high performance GPU computing primitives and optimization strategies with a high-level ... primitives with small code size and minimal GPU programming knowledge...
  • Jacket

  • Referenced in 4 articles [sw11529]
  • enables standard Matlab. It greatly simplifies GPU computing for engineers, scientists, and technical computing professionals ... Jacket, most Matlab users are using the GPU and obtaining significant performance gains...
  • MatConvNet

  • Referenced in 11 articles [sw15651]
  • same time, it supports efficient computation on CPU and GPU, allowing to train complex models...
  • CULA

  • Referenced in 11 articles [sw12745]
  • modern graphics processing unit (GPU) found in many standard personal computers is a highly parallel ... ratio. High-level linear algebra operations are computationally intense, often requiring O(N3) operations ... processing power of the GPU. Our work is on CULA, a GPU accelerated implementation...
  • SpGEMM

  • Referenced in 5 articles [sw14033]
  • well as with three GPU-based implementations. Measurements performed for computing the matrix square ... GPU caching architecture. An improved performance was also found for computing Galerkin products which...