• CUDA

  • Referenced in 1275 articles [sw03258]
  • CUDA Toolkit includes a compiler for NVIDIA GPUs, math libraries, and tools for debugging ... started quickly accelerating your application with GPUs...
  • TensorFlow

  • Referenced in 465 articles [sw15170]
  • computation to one or more CPUs or GPUs in a desktop, server, or mobile device...
  • Neural Network Toolbox

  • Referenced in 175 articles [sw07378]
  • distribute computations and data across multicore processors, GPUs, and computer clusters using Parallel Computing Toolbox...
  • Nektar++

  • Referenced in 85 articles [sw11964]
  • architectures such as modern graphical processing units (GPUs). We propose an algorithmic pipeline for mapping ... amenable to the fine-grained parallelism of GPUs. We demonstrate that the HDG method...
  • Firedrake

  • Referenced in 84 articles [sw14923]
  • PDEs and employ either conventional CPUs or GPUs to obtain the solution. Firedrake employs...
  • CUBLAS

  • Referenced in 80 articles [sw06880]
  • does not auto-parallelize across multiple GPUs. To use the CUBLAS library, the application must...
  • DualSPHysics

  • Referenced in 45 articles [sw17653]
  • parallel power computing of Graphics Computing Units (GPUs) is used to accelerate DualSPHysics...
  • cuFFT

  • Referenced in 24 articles [sw11258]
  • designed to provide high performance on NVIDIA GPUs. The cuFFTW library is provided ... users of FFTW to start using NVIDIA GPUs with a minimum amount of effort ... inputs and options efficiently on NVIDIA GPUs. This version of the cuFFT library supports...
  • Image Processing Toolbox

  • Referenced in 35 articles [sw13352]
  • registration. Many toolbox functions support multicore processors, GPUs, and C-code generation. Image Processing Toolbox...
  • MXNet

  • Referenced in 32 articles [sw20940]
  • portable and lightweight, scaling effectively to multiple GPUs and multiple machines. MXNet is also more...
  • Kokkos

  • Referenced in 24 articles [sw20455]
  • many-core CPUs), Cuda (for NVIDIA GPUs), and OpenMP (for Intel Phi). Note that...
  • OCTOPUS

  • Referenced in 21 articles [sw08525]
  • also has support for graphical processing units (GPUs) through OpenCL. Octopus is free software, released...
  • GPGPU

  • Referenced in 20 articles [sw09105]
  • GPGPU stands for ”General-Purpose Computation on GPUs”. GPGPU researchers have achieved over an order...
  • cuRAND

  • Referenced in 19 articles [sw11536]
  • hundreds of processor cores available in NVIDIA GPUs. cuRAND also provides two flexible interfaces, allowing...
  • BinaryConnect

  • Referenced in 18 articles [sw35871]
  • sets and large models. In the past, GPUs enabled these breakthroughs because of their greater...
  • CholQR

  • Referenced in 11 articles [sw13049]
  • case studies on multicore CPU with multiple gpus. To orthonormalize the columns of a dense ... problem, on a multicore CPU with multiple GPUs. These case studies demonstrate that by using...
  • QUDA

  • Referenced in 9 articles [sw14040]
  • QUDA: A library for QCD on GPUs. QUDA is a library for performing calculations ... lattice QCD on graphics processing units (GPUs), leveraging NVIDIA’s CUDA platform. The current release ... clover-field construction. Use of many GPUs in parallel is supported throughout, with communication handled...
  • yaSpMV

  • Referenced in 7 articles [sw17482]
  • yaSpMV: Yet another SpMV framework on GPUs. SpMV is a key linear algebra algorithm ... have been made to optimize SpMV on GPUs to leverage their massive computational throughput. Although ... hardware platforms. Our experimental results on GTX680 GPUs and GTX480 GPUs show that our proposed ... average on GTX680 GPUs, up to 150% and 42% on average on GTX480 GPUs...
  • GKLEE

  • Referenced in 10 articles [sw12794]
  • GKLEE: concolic verification and test generation for GPUs. Programs written for GPUs often contain correctness...
  • Sailfish

  • Referenced in 10 articles [sw16828]
  • method (LBM) on modern Graphics Processing Units (GPUs) using CUDA/OpenCL. We take a novel approach ... principles of the code, scaling to multiple GPUs in a distributed environment, as well...