• BayesDA

  • Referenced in 1060 articles [sw11008]
  • BayesDA: Functions and Datasets for the book ”Bayesian Data Analysis” Functions for Bayesian Data Analysis ... with datasets from the book ”Bayesian data Analysis (second edition)” by Gelman, Carlin, Stern ... Rubin. Not all datasets yet, hopefully completed soon...
  • longmemo

  • Referenced in 598 articles [sw11216]
  • Memory Processes (Jan Beran) – Data and Functions. Datasets and Functionality from the textbook Jan Beran...
  • MASS (R)

  • Referenced in 281 articles [sw04479]
  • package MASS: Support Functions and Datasets for Venables and Ripley’s MASS , Functions and datasets...
  • MapReduce

  • Referenced in 248 articles [sw00546]
  • calculation over extremely large datasets. The arrival of MapReduce provides a chance to utilize commodity...
  • ParaView

  • Referenced in 154 articles [sw06128]
  • ParaView was developed to analyze extremely large datasets using distributed memory computing resources ... supercomputers to analyze datasets of terascale as well as on laptops for smaller data...
  • SMOTE

  • Referenced in 125 articles [sw34239]
  • construction of classifiers from imbalanced datasets is described. A dataset is imbalanced if the classification...
  • BSDS

  • Referenced in 170 articles [sw14427]
  • Berkeley Segmentation Dataset and Benchmark. A database of human segmented natural images and its application...
  • DistAl

  • Referenced in 99 articles [sw01746]
  • datamining and knowledge acquisition from large datasets. The paper presents results of experiments using several ... artificial and real-world datasets. The results demonstrate that DistAl compares favorably with other learning...
  • CIFAR

  • Referenced in 66 articles [sw17861]
  • subsets of the 80 million tiny images dataset. They were collected by Alex Krizhevsky, Vinod ... Nair, and Geoffrey Hinton. The CIFAR-10 dataset consists of 60000 32x32 colour images ... training images and 10000 test images. The dataset is divided into five training batches ... images from each class. The CIFAR-100 dataset: This dataset is just like the CIFAR...
  • gSpan

  • Referenced in 108 articles [sw11908]
  • frequent graph-based pattern mining in graph datasets and propose a novel algorithm called gSpan...
  • RSVM

  • Referenced in 52 articles [sw15261]
  • little as 1% of a large dataset for its explicit evaluation. To generate this nonlinear ... surface, the entire dataset is used as a constraint in an optimization problem with very ... small randomly selected portion of the dataset, is better than that of a conventional support ... surface that explicitly depends on the entire dataset, and much better than a conventional...
  • Pegasos

  • Referenced in 93 articles [sw08752]
  • especially suited for learning from large datasets. Our approach also extends to non-linear kernels...
  • ImageNet

  • Referenced in 92 articles [sw21105]
  • ImageNet is an image dataset organized according to the WordNet hierarchy. Each meaningful concept...
  • FRK

  • Referenced in 90 articles [sw19172]
  • spatial/spatio-temporal modelling and prediction with large datasets. The approach, discussed in Cressie and Johannesson...
  • DBpedia

  • Referenced in 52 articles [sw27336]
  • allows you to ask sophisticated queries against datasets derived from Wikipedia and to link other ... datasets on the Web to Wikipedia data. We describe the extraction of the DBpedia datasets ... status of interlinking DBpedia with other open datasets on the Web and outline how DBpedia...
  • ROBPCA

  • Referenced in 63 articles [sw11592]
  • ROBPCA yields more accurate estimates at noncontaminated datasets and more robust estimates at contaminated data ... outliers. We apply the algorithm to several datasets from chemometrics and engineering...
  • LOF

  • Referenced in 82 articles [sw19311]
  • enjoys many desirable properties. Using real-world datasets, we demonstrate that LOF can be used...
  • Cd-hit

  • Referenced in 46 articles [sw16887]
  • compares two protein datasets and reports similar matches between them; cd-hit-est clusters ... compares two nucleotide datasets. All these programs can handle huge datasets with millions of sequences...
  • BioGRID

  • Referenced in 45 articles [sw17422]
  • BioGRID: A general repository for interaction datasets. Access to unified datasets of protein and genetic ... interaction data. Full or user-defined datasets are freely downloadable as tab-delimited text files...