• Hadoop

  • Referenced in 113 articles [sw08481]
  • Apache Hadoop software library is a framework that allows for the distributed processing of large...
  • XGBoost

  • Referenced in 42 articles [sw21035]
  • same code runs on major distributed environment (Hadoop, SGE, MPI) and can solve problems beyond...
  • Spark

  • Referenced in 31 articles [sw23653]
  • partition is lost. Spark can outperform Hadoop by 10x in iterative machine learning jobs...
  • G-Hadoop

  • Referenced in 12 articles [sw08480]
  • security framework in G-Hadoop for big data computing across distributed cloud data centres. MapReduce ... large-scale data-intensive applications. The Hadoop framework is a well-known MapReduce implementation that ... MapReduce tasks on a cluster system. G-Hadoop is an extension of the Hadoop MapReduce ... multiple clusters. However, G-Hadoop simply reuses the user authentication and job submission mechanism...
  • KNIME

  • Referenced in 20 articles [sw06790]
  • available for distributed frameworks such as Hadoop. KNIME is used by over 3000 organizations...
  • CloudBurst

  • Referenced in 7 articles [sw12025]
  • consuming, but CloudBurst uses the open-source Hadoop implementation of MapReduce to parallelize execution using...
  • Twister

  • Referenced in 7 articles [sw27735]
  • Twister with other similar runtimes such as Hadoop and DryadLINQ for large scale data parallel...
  • RHadoop

  • Referenced in 4 articles [sw23467]
  • users to manage and analyze data with Hadoop. The packages have been tested (and always ... recent releases of the Cloudera and Hortonworks Hadoop distributions and should have broad compatibility with ... open source Hadoop and mapR’s distribution. We normally test on recent Revolution R/Microsoft...
  • HaLoop

  • Referenced in 3 articles [sw27955]
  • HaLoop is a modified version of the Hadoop MapReduce framework, designed to serve these applications ... reduces query runtimes by 1.85 compared with Hadoop, and shuffles only 4% of the data ... between mappers and reducers compared with Hadoop. In short, HaLoop has the following features ... users reuse major building blocks from applications’ Hadoop implementations, and 3) have similar intra...
  • HBase

  • Referenced in 6 articles [sw10948]
  • provides Bigtable-like capabilities on top of Hadoop and HDFS...
  • PEGASUS

  • Referenced in 6 articles [sw17479]
  • library, implemented on the top of the Hadoop platform, the open source version of MapReduce...
  • BlobSeer

  • Referenced in 5 articles [sw10569]
  • integrate as a storage backend in the Hadoop MapReduce framework. We perform extensive microbenchmarks...
  • pmml

  • Referenced in 5 articles [sw13532]
  • Sybase IQ, Teradata and Teradata Aster) or Hadoop (Datameer and Hive...
  • Hadoop-BAM

  • Referenced in 2 articles [sw12021]
  • Hadoop-BAM: directly manipulating next generation sequencing data in the cloud. Summary: Hadoop ... aligned next-generation sequencing data in the Hadoop distributed computing framework. It acts ... files that are processed using Hadoop. Hadoop-BAM solves the issues related to BAM data ... this article we demonstrate the use of Hadoop-BAM by building a coverage summarizing tool...
  • hive

  • Referenced in 2 articles [sw24639]
  • package hive: Hadoop InteractiVE. Hadoop InteractiVE facilitates distributed computing via the MapReduce paradigm through ... Hadoop. An easy to use interface to Hadoop, the Hadoop Distributed File System (HDFS ... Hadoop Streaming is provided...
  • CloudGenius

  • Referenced in 4 articles [sw20154]
  • selection algorithm and the GA deployable on hadoop clusters. Experiments with CumulusGenius give insights...
  • VC3

  • Referenced in 2 articles [sw23078]
  • their results. VC3 runs on unmodified Hadoop, but crucially keeps Hadoop, the operating system ... that VC3 performs well compared with unprotected Hadoop: VC3’s average runtime overhead is negligible...
  • HadoopStreaming

  • Referenced in 2 articles [sw23401]
  • HadoopStreaming: Utilities for using R scripts in Hadoop streaming. Provides a framework for writing map/reduce ... scripts for use in Hadoop Streaming. Also facilitates operating on data in a streaming fashion ... without Hadoop...
  • rmr2

  • Referenced in 2 articles [sw24637]
  • users to manage and analyze data with Hadoop. rmr2: A package that allows R developer ... perform statistical analysis in R via Hadoop MapReduce functionality on a Hadoop cluster. Install this...
  • rhdfs

  • Referenced in 2 articles [sw24636]
  • users to manage and analyze data with Hadoop. rhdfs: This package provides basic connectivity ... Hadoop Distributed File System. R programmers can browse, read, write, and modify files stored...