• TheLMA

  • Referenced in 9 articles [sw12960]
  • physical standpoint. Our program is based on CUDA and uses POSIX threads to manage multiple...
  • CUDPP

  • Referenced in 6 articles [sw12697]
  • CUDA Data Parallel Primitives Library. CUDPP is a library of data-parallel algorithm primitives such ... tables. CUDPP runs on processors that support CUDA...
  • Copperhead

  • Referenced in 5 articles [sw30955]
  • techniques with several examples targeting the CUDA platform for parallel programming on GPUs. Copperhead code ... times fewer lines of code than CUDA, and the compiler generates efficient code, yielding ... performance of hand-crafted, well optimized CUDA code...
  • BPAS

  • Referenced in 8 articles [sw08399]
  • which combines serial C code and multithreaded CUDA code. However, each of the BPAS library...
  • GPU Quicksort

  • Referenced in 8 articles [sw12707]
  • graphics processors, but we show that in CUDA, NVIDIA’s programing platform for general-purpose...
  • BioHEL

  • Referenced in 7 articles [sw08532]
  • novel meta-representation called AKLR and a CUDA-based evaluation process are used to speed...
  • QUDA

  • Referenced in 7 articles [sw14040]
  • graphics processing units (GPUs), leveraging NVIDIA’s CUDA platform. The current release includes optimized Dirac...
  • NTRUEncrypt

  • Referenced in 7 articles [sw14148]
  • first time on a GPU using the CUDA platform. As is shown, this operation lends...
  • CP2K

  • Referenced in 7 articles [sw15391]
  • combination of multi-threading, MPI, and CUDA. It is freely available under the GPL license...
  • CUDA-lite

  • Referenced in 3 articles [sw13619]
  • CUDA-Lite: Reducing GPU Programming Complexity. The computer industry has transitioned into multi-core ... many-core parallel systems. The CUDA programming environment from NVIDIA is an attempt to make ... programmer to maximize performance when using CUDA. One such burden is dealing with the complex ... better performed by automated tools. We present CUDA-lite, an enhancement to CUDA...
  • STOCHSIMGPU

  • Referenced in 6 articles [sw10711]
  • using the computational power of your NVIDIA CUDA GPU without changing your model code...
  • Nikola

  • Referenced in 6 articles [sw14049]
  • Haskell that compiles to GPUs via CUDA using a new set of type-directed techniques...
  • TTC

  • Referenced in 6 articles [sw15828]
  • Knights Corner as well as different CUDA-based GPUs such as NVIDIA’s Kepler...
  • Kokkos

  • Referenced in 6 articles [sw20455]
  • These are OpenMP (for many-core CPUs), Cuda (for NVIDIA GPUs), and OpenMP (for Intel...
  • LBHydra

  • Referenced in 6 articles [sw22028]
  • user to harness the power of CUDA-compliant nVIDIA graphics processing units (GPUs). These modules...
  • Halide

  • Referenced in 6 articles [sw22108]
  • Compiler targets include x86/SSE, ARM v7/NEON, CUDA, and OpenCL...
  • Jacket

  • Referenced in 4 articles [sw11529]
  • Acceleration Engine for Matlab. Jacket uses the CUDA technology from Nvidia to utilize commodity video ... professionals. Users do not need to learn CUDA, SIMD, HPC, and other complicated parallel programming...
  • CAMPARY

  • Referenced in 4 articles [sw15156]
  • CAMPARY: Cuda Multiple Precision Arithmetic Library and Applications. Many scientific computing applications demand massive numerical ... precision floating-point arithmetic library using the CUDA programming language for the NVidia GPU platform...
  • CuPy

  • Referenced in 4 articles [sw27021]
  • open-source matrix library accelerated with NVIDIA CUDA. It also uses CUDA-related libraries including...
  • Ocelot

  • Referenced in 3 articles [sw09713]
  • data parallel execution model used by NVIDIA CUDA applications onto diverse multithreaded platforms. Ocelot includes ... dynamic compiler is able to execute existing CUDA binaries without recompilation from source and supports ... against over 130 applications taken from the CUDA SDK, the UIUC Parboil benchmarks...