• Nektar++

  • Referenced in 65 articles [sw11964]
  • Exploiting batch processing on streaming architectures to solve 2D elliptic finite element problems: a hybridized ... local matrix generation stage coupled with the parallelization techniques developed for the linear system solvers ... candidate for implementation on streaming architectures such as modern graphical processing units (GPUs). We propose ... method amenable to the fine-grained parallelism of GPUs. We demonstrate that the HDG method...
  • GPU-ABiSort

  • Referenced in 4 articles [sw20473]
  • present a novel approach for parallel sorting on stream processing architectures. It is based ... sorting. For sorting n values utilizing p stream processor units, this approach achieves the optimal ... This is achieved by using efficient linear stream memory accesses (and by combining the optimal ... graphics hardware (GPUs). On GPUs, our optimal parallel sorting approach has shown to be remarkably...
  • Chromium

  • Referenced in 5 articles [sw14150]
  • stream filters can be arranged to create sort-first and sort-last parallel graphics architectures ... these stream filters can be extended programmatically, allowing the user to customize the stream transformations ... cluster. Because our stream processing mechanism is completely general, any cluster-parallel rendering algorithm ... describe other potential uses of this stream processing technology. By completely abstracting the underlying graphics...
  • RngStreams

  • Referenced in 3 articles [sw11401]
  • creating independent random number streams for simulations in parallel processing environments. Techniques are presented...
  • ParGAP

  • Referenced in 6 articles [sw04905]
  • implements “parallel streaming”, running distinct algorithms on the same problem, but under different processes...
  • GHC

  • Referenced in 43 articles [sw23765]
  • other formalisms including dataflow languages, Communicating Sequential Processes, and functional languages for multiprocessing. Except ... standard not only of parallel logic programming languages but of parallel programming languages. Moreover ... design stage of GHC. We showed that stream merging and distribution which are expected...
  • GraphBolt

  • Referenced in 1 article [sw32526]
  • efficient streaming graph processing system that provides Bulk Synchronous Parallel (BSP) guarantees. GraphBolt performs dependency...
  • BSGP

  • Referenced in 5 articles [sw08995]
  • minimum of extra information to describe parallel processing on GPUs. As a result, BSGP programs ... combines them using optimally allocated temporary streams. In our benchmark, BSGP programs achieve similar...
  • Sigma*

  • Referenced in 8 articles [sw21731]
  • class of stream filters---programs that process streams of data items---for which Sigma* converges ... useful for web sanitizer verification and stream programs compiler optimizations, as we show experimentally ... Sigma* can boost performance of stream programs by parallelized code generation...
  • ParMAC

  • Referenced in 0 articles [sw15226]
  • ParMAC. ParMAC works on a cluster of processing machines with a circular topology and alternates ... parallel using stochastic updates, and the other trains the coordinates in parallel. Only submodel parameters ... shuffling, load balancing, fault tolerance and streaming data processing. We study the convergence of ParMAC ... theoretical model of its runtime and parallel speedup. We develop ParMAC to learn binary autoencoders...
  • Caravela

  • Referenced in 3 articles [sw14152]
  • processing units to execute multiple flow-models, connected by means of input-output streams, using ... using it. Caravela platform establishes a distributed parallel computing environment through a virtual network using ... flow-model, consisting of input-output streams, constant parameters and program, can be executed ... data stream, etc. The paper focuses on an extension of the pipelined processing mechanism, directly...
  • CUDAMPF++

  • Referenced in 1 article [sw21326]
  • central processing units (CPUs) with the support of streaming SIMD extensions (SSE) instructions. Few acceleration ... this paper, we propose a five-tiered parallel framework, CUDAMPF++, to accelerate the most computationally ... segment Viterbi (MSV/SSV), on a single graphics processing unit (GPU). As an architecture-aware design...
  • StreamFlex

  • Referenced in 0 articles [sw30244]
  • stream programming paradigm aims to expose coarse-grained parallelism inapplications that must process continuous sequences ... determined by the state of their inputchannels, stream programs have fewer opportunities for the errors...
  • MPtostream

  • Referenced in 3 articles [sw14131]
  • existing scientific computing applications to the heterogeneous parallel systems remains a big challenge. The OpenMP ... extended OpenMP for AMD’s stream processing GPUs. Our experimental results show that programming with...
  • Horizon

  • Referenced in 1 article [sw20671]
  • Horizon - exploiting timing information for parallel network simulation. Network simulation faces an increasing demand ... detailed models includes both accurate estimations of processing time and in-depth modeling of wireless ... meeting the deadlines of a multi-media streaming application such as VoIP...
  • samoa2

  • Referenced in 4 articles [sw25252]
  • memory efficient. Based purely on stack and stream data structures, it completely avoids random memory ... features hybrid MPI+OpenMP parallelization based on the Sierpinski order induced on the elements. Sections ... migration of grid cells between MPI processes. Using optimized communication and load balancing algorithms...
  • RMerge2

  • Referenced in 1 article [sw25236]
  • vector-matrix products and computes these by parallel row merging. It merges ... their number of nonzeros and are processed with specific kernels implemented using C++ templates. This ... hidden by concurrent kernel execution based on streams. Performance measurements show that merging more than ... respectively. Compared to a parallel CPU implementation, RMerge2 achieves a mean speedup...
  • CASTREAM

  • Referenced in 2 articles [sw15302]
  • combiners, it combines 128-bit blocks using parallel evolution of Cellular Automata (CA) and small ... boxes prevent the algebraic attacks. The stream cipher takes 128 bits Key and 128 bits ... nearly 8 bits per cycle. The initialization process needs 96 cycles which is much faster...
  • Gadgetron

  • Referenced in 1 article [sw30564]
  • data to reconstructed images. The data processing pipeline is configured dynamically at run‐time based ... rapid prototyping. In addition to the streaming framework infrastructure, the Gadgetron comes with ... reconstruction. This includes generic toolboxes for data‐parallel (e.g., GPU‐based) execution of compute‐intensive...
  • Jug

  • Referenced in 2 articles [sw27402]
  • Software for Parallel Reproducible Computation in Python. As computational pipelines become a bigger part ... able to even completely describe the process that led to them. For large projects ... necessary to save intermediate results while down-stream analyses are designed (or re-designed...