• Cd-hit

  • Referenced in 46 articles [sw16887]
  • compares two protein datasets and reports similar matches between them; cd-hit-est clusters ... compares two nucleotide datasets. All these programs can handle huge datasets with millions of sequences...
  • BioGRID

  • Referenced in 46 articles [sw17422]
  • BioGRID: A general repository for interaction datasets. Access to unified datasets of protein and genetic ... interaction data. Full or user-defined datasets are freely downloadable as tab-delimited text files...
  • WebGraph

  • Referenced in 45 articles [sw30097]
  • provide a high compression ratio (see our datasets). The algorithms are controlled by several parameters ... graph, so to experiment with various settings. Datasets for very large graph (e.g., a billion ... files and downloading a dataset. This makes studying phenomena such as PageRank, distribution of graph...
  • KONECT

  • Referenced in 37 articles [sw17480]
  • Collection (KONECT), a project to collect network datasets in the areas of web science, network ... last decades many such datasets are now openly available. The KONECT project thus ... goal of collecting many diverse network datasets from the Web, and providing ... collection of over 160 network datasets, consisting of directed, undirected, unipartite, bipartite, weighted, unweighted, signed...
  • Caltech-256

  • Referenced in 50 articles [sw30598]
  • Caltech-256 Object Category Dataset. We introduce a challenging set of 256 object categories containing ... measure classification performance, then benchmark the dataset using two simple metrics as well...
  • PrivateLR

  • Referenced in 69 articles [sw11354]
  • epsilon for any pair D, D’ of datasets that differ in exactly one element...
  • SSVM

  • Referenced in 60 articles [sw12678]
  • algorithm. On six publicly available datasets, tenfold cross validation correctness of SSVM was the highest...
  • SDaA

  • Referenced in 59 articles [sw11941]
  • SDaA: Sampling: Design and Analysis. Functions and Datasets from Lohr, S. (1999), Sampling: Design...
  • Eigentaste

  • Referenced in 59 articles [sw12451]
  • when predictions are random. On the Jester dataset, Eigentaste computes recommendations two orders of magnitude...
  • ARACNE

  • Referenced in 34 articles [sw17200]
  • regulatory networks using both a realistic synthetic dataset and a microarray dataset from human ... cells. On synthetic datasets ARACNE achieves very low error rates and outperforms established methods, such...
  • Memtype-2L

  • Referenced in 39 articles [sw16472]
  • MemType-2L on a new-constructed stringent dataset by both the jackknife test ... independent dataset test are quite high, indicating that MemType-2L may become a very useful...
  • iSNO-PseAAC

  • Referenced in 39 articles [sw22446]
  • algorithm. As a demonstration, a benchmark dataset was constructed that contains 731 SNO sites ... identifying nitrosylated proteins on an independent dataset was over 90%, indicating that the new predictor...
  • GOLEM

  • Referenced in 52 articles [sw24695]
  • mesh design. GOLEM copes efficiently with large datasets. It achieves this efficiency because it avoids...
  • iLoc-Hum

  • Referenced in 36 articles [sw22433]
  • performed with iLoc-Hum on a benchmark dataset of human proteins that covers the following ... comparisons were also made via two independent datasets; all indicated that the success rates...
  • boot

  • Referenced in 49 articles [sw04518]
  • Angelo Canty for S) , functions and datasets for bootstrapping from the book ”Bootstrap Methods...
  • logcondens

  • Referenced in 49 articles [sw11215]
  • density at a fixed point. Finally, three datasets that have been used to illustrate...
  • SLIQ

  • Referenced in 49 articles [sw11759]
  • strategy to enable classification of disk-resident datasets. SLIQ also uses a new tree-pruning...
  • CAPUSHE

  • Referenced in 49 articles [sw13365]
  • additional application, the CAPUSHE package and the datasets presented in this paper, are available...
  • Daisy

  • Referenced in 34 articles [sw08597]
  • papers, are not reproducible. In many cases, datasets and time series, that are used ... called DAISY , to which authors can submit datasets that are used to illustrate certain claims...
  • Spark

  • Referenced in 34 articles [sw23653]
  • Spark introduces an abstraction called resilient distributed datasets (RDDs). An RDD is a read-only ... used to interactively query a 39 GB dataset with sub-second response time...