• MapReduce

  • Referenced in 263 articles [sw00546]
  • MapReduce is a new parallel programming model initially developed for large-scale web content processing ... over extremely large datasets. The arrival of MapReduce provides a chance to utilize commodity hardware ... optimization from relational algebra operators to MapReduce programs is still an open and dynamic research ... first study the communication cost of the MapReduce model, then we give an initial implementation...
  • Spark

  • Referenced in 41 articles [sw23653]
  • Spark: cluster computing with working sets. MapReduce and its variants have been highly successful ... retaining the scalability and fault tolerance of MapReduce. To achieve these goals, Spark introduces...
  • GraphLab

  • Referenced in 24 articles [sw12830]
  • challenging. Existing high-level parallel abstractions like MapReduce are insufficiently expressive while low-level tools ... developed GraphLab, which improves upon abstractions like MapReduce by compactly expressing asynchronous iterative algorithms with...
  • GATK

  • Referenced in 17 articles [sw12019]
  • genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Next-generation ... sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich...
  • G-Hadoop

  • Referenced in 12 articles [sw08480]
  • data computing across distributed cloud data centres. MapReduce is regarded as an adequate programming model ... Hadoop framework is a well-known MapReduce implementation that runs the MapReduce tasks ... Hadoop is an extension of the Hadoop MapReduce framework with the functionality of allowing ... MapReduce tasks to run on multiple clusters. However, G-Hadoop simply reuses the user authentication...
  • Twister

  • Referenced in 6 articles [sw27735]
  • Twister: a runtime for iterative MapReduce. MapReduce programming model has simplified the implementation of many ... services provided by many implementations of MapReduce attract a lot of enthusiasm among distributed computing ... From the years of experience in applying MapReduce to various scientific applications we identified ... architecture that will expand the applicability of MapReduce to more classes of applications. In this...
  • WebPIE

  • Referenced in 8 articles [sw12482]
  • scale parallel inference engine using MapReduce. The large amount of Semantic Web data ... Horst semantics using the MapReduce programming model. We will show that a straightforward implementation...
  • CloudBurst

  • Referenced in 6 articles [sw12025]
  • CloudBurst: highly sensitive read mapping with MapReduce. Motivation: Next-generation DNA sequencing machines are generating ... uses the open-source Hadoop implementation of MapReduce to parallelize execution using multiple compute nodes ... model for parallelizing algorithms with MapReduce at http://cloudburst-bio.sourceforge.net...
  • Geppetto

  • Referenced in 10 articles [sw31791]
  • sharing state between computations (e.g, For MapReduce) or within a single computation...
  • PLANET

  • Referenced in 5 articles [sw15434]
  • Massively parallel learning of tree ensembles with mapreduce. Classification and regression tree learning on massive ... computations, and implements each one using the MapReduce model of distributed computation. We show ... benefits and challenges of using a MapReduce compute cluster for tree learning, and demonstrate...
  • MrsRF

  • Referenced in 4 articles [sw12018]
  • MrsRF: an efficient MapReduce algorithm for analyzing large collections of evolutionary trees. MapReduce ... paper, we evaluate the viability of the MapReduce framework for designing phylogenetic applications. The problem ... collections of evolutionary trees. We introduce MrsRF (MapReduce Speeds up RF), a multi-core algorithm ... distance matrix between t trees using the MapReduce paradigm...
  • PEGASUS

  • Referenced in 8 articles [sw17479]
  • Hadoop platform, the open source version of MapReduce. Many graph mining operations (PageRank, spectral clustering...
  • BlobSeer

  • Referenced in 5 articles [sw10569]
  • storage backend in the Hadoop MapReduce framework. We perform extensive microbenchmarks as well as experiments ... with real MapReduce applications: they demonstrate that applying the principles defended in our approach brings...
  • DryadLINQ

  • Referenced in 7 articles [sw23712]
  • generalizes previous execution environments such as SQL, MapReduce, and Dryad in two ways: by adopting...
  • LaraDB

  • Referenced in 4 articles [sw39113]
  • middleware algebra: more explicit than MapReduce but more general than ... LaraDB implementation outperforms Accumulo’s native MapReduce integration on a core task involving join...
  • VC3

  • Referenced in 3 articles [sw23078]
  • system that allows users to run distributed MapReduce computations in the cloud while keeping their ... deploy new protocols that secure distributed MapReduce computations. VC3 optionally enforces region self-integrity invariants ... MapReduce code running within isolated regions, to prevent attacks due to unsafe memory reads...
  • MR-DBSCAN

  • Referenced in 3 articles [sw30348]
  • Efficient Parallel Density-Based Clustering Algorithm Using MapReduce. Data clustering is an important data mining ... large scale in the real world. Meanwhile, MapReduce is a desirable parallel programming platform that ... implement it by a 4-stages MapReduce paradigm. Furthermore, we adopt a quick partitioning strategy...
  • Dremel

  • Referenced in 5 articles [sw13849]
  • Dremel, and explain how it complements MapReduce-based computing. We present a novel columnar storage...
  • HaLoop

  • Referenced in 3 articles [sw27955]
  • modified version of the Hadoop MapReduce framework, designed to serve these applications. HaLoop not only ... extends MapReduce with programming support for iterative applications, but also dramatically improves their efficiency...
  • Vispark

  • Referenced in 2 articles [sw17471]
  • data processing in diverse application domains, MapReduce (e.g., Hadoop) has become one of the standard ... cluster system. Despite its popularity, the current MapReduce framework suffers from inflexibility and inefficiency inherent ... novel extension of Spark for GPU-accelerated MapReduce processing on array-based scientific computing ... syntax and a novel data abstraction for MapReduce programming on a GPU cluster system. Vispark...