UCI++: Improved Support for Algorithm Selection Using Datasetoids. As companies employ a larger number of models, the problem of algorithm (and parameter) selection is becoming increasingly important. Two approaches to obtain empirical knowledge that is useful for that purpose are empirical studies and metalearning. However, most empirical (meta)knowledge is obtained from a relatively small set of datasets. In this paper, we propose a method to obtain a large number of datasets which is based on a simple transformation of existing datasets, referred to as datasetoids. We test our approach on the problem of using metalearning to predict when to prune decision trees. The results show significant improvement when using datasetoids. Additionally, we identify a number of potential anomalies in the generated datasetoids and propose methods to solve them

References in zbMATH (referenced in 3 articles , 1 standard article )

Showing results 1 to 3 of 3.
Sorted by year (citations)

  1. Muñoz, Mario A.; Villanova, Laura; Baatar, Davaatseren; Smith-Miles, Kate: Instance spaces for machine learning classification (2018)
  2. PrudĂȘncio, Ricardo B. C.; Ludermir, Teresa B.: Combining uncertainty sampling methods for supporting the generation of meta-examples (2012) ioport
  3. Soares, Carlos: UCI++: Improved support for algorithm selection using datasetoids (2009) ioport