• Apache Spark

  • Referenced in 63 articles [sw28418]
  • Apache Spark: Spark is a fast and general cluster computing system for Big Data...
  • MLlib

  • Referenced in 25 articles [sw15430]
  • MLlib: machine learning in apache spark. Apache Spark is a popular open-source platform...
  • MLbase

  • Referenced in 7 articles [sw15433]
  • testbed for MLlib. MLlib: Apache Spark’s distributed ML library. MLlib was initially developed...
  • AMIDST

  • Referenced in 5 articles [sw21741]
  • modern big data processing tools like Apache Spark or Apache Flink, which efficiently support iterative...
  • sparklyr

  • Referenced in 3 articles [sw19334]
  • package sparklyr: R Interface to Apache Spark. R interface to Apache Spark, a fast ... supports connecting to local and remote Apache Spark clusters, provides a ’dplyr’ compatible back...
  • ABCpy

  • Referenced in 5 articles [sw29748]
  • single computer or using an Apache Spark or MPI enabled cluster. The modularity helps domain...
  • SparkSW

  • Referenced in 2 articles [sw22017]
  • poses significant computational challenges. Apache Spark is an increasingly popular fast big data analytics engine ... that implements the SW algorithm on Apache Spark based distributed computing framework, with a couple ... success of SparkSW also reveals that Apache Spark framework provides an efficient solution to facilitate...
  • SparkR

  • Referenced in 3 articles [sw25055]
  • light-weight frontend to use Apache Spark from R. SparkR exposes the Spark API through...
  • PyODDS

  • Referenced in 3 articles [sw38163]
  • executions based on an Apache Spark backend server and a light-weight database. It also...
  • NScaleSpark

  • Referenced in 1 article [sw41762]
  • NScaleSpark: subgraph-centric graph analytics on Apache Spark. In this paper, we describe NScaleSpark ... distributed graph analysis tasks on the Apache Spark platform. NScaleSpark is motivated by the increasing ... GraphX (built on top of Apache Spark). However, the TLV paradigm is not suitable ... reimplemented NScale on the Apache Spark platform, the key challenges therein, and the design decisions...
  • Deeplearning4j

  • Referenced in 2 articles [sw27211]
  • latest distributed computing frameworks including Apache Spark and Hadoop to accelerate training. On multi-GPUs ... performance. The libraries are completely open-source, Apache 2.0, and maintained by the developer community...
  • SparkRnS

  • Referenced in 1 article [sw23652]
  • selection using Spark. This is an Apache Spark implementation of the GSP procedure for solving...
  • CRoaring

  • Referenced in 1 article [sw21564]
  • Several important systems such as Elasticsearch, Apache Spark, Netflix’s Atlas, LinkedIn’s Pivot, Metamarkets...
  • PySpark

  • Referenced in 1 article [sw40697]
  • PySpark is an interface for Apache Spark in Python. It not only allows...
  • GraphFrames

  • Referenced in 1 article [sw32536]
  • DataFrame-based graphs on top of Apache Spark. Users can write highly expressive queries...
  • BigDebug

  • Referenced in 1 article [sw27739]
  • primitives for interactive big data processing in spark. Developers use cloud computing platforms to process ... primitives for big data processing in Apache Spark, the next generation data-intensive scalable cloud...
  • SparkSeq

  • Referenced in 1 article [sw34381]
  • advantage of a new MapReduce framework, Apache Spark, for next-generation sequencing data. SparkSeq ... Availability and implementation: Available under open source Apache 2.0 license: https://bitbucket.org/mwiewiorka/sparkseq/...
  • spark.sas7bdat

  • Referenced in 0 articles [sw17774]
  • Data (’.sas7bdat’ Files) into ’Apache Spark’ from R. ’Apache Spark’ is an open source cluster...
  • spark-crowd

  • Referenced in 0 articles [sw28417]
  • paper, we present spark-crowd, an Apache Spark package for learning from crowdsourced data with...