SLIQ: A fast scalable classifier for data mining. Classification is an important problem in the emerging field of data mining. Although classification has been studied extensively in the past, most of the classification algorithms are designed only for memory-resident data, thus limiting their suitability for data mining large data sets. This paper discusses issues in building a scalable classifier and presents the design of SLIQ, a new classifier. SLIQ is a decision tree classifier that can handle both numeric and categorical attributes. It uses a novel pre-sorting technique in the tree-growth phase. This sorting procedure is integrated with a breadth-first tree growing strategy to enable classification of disk-resident datasets. SLIQ also uses a new tree-pruning algorithm that is inexpensive, and results in compact and accurate trees. The combination of these techniques enables SLIQ to scale for large data sets and classify data sets irrespective of the number of classes, attributes, and examples (records), thus making it an attractive tool for data mining.

References in zbMATH (referenced in 50 articles )

Showing results 1 to 20 of 50.
Sorted by year (citations)

1 2 3 next

  1. Calzavara, Stefano; Lucchese, Claudio; Tolomei, Gabriele; Abebe, Seyum Assefa; Orlando, Salvatore: \textscTreant: training evasion-aware decision trees (2020)
  2. Tahmassebi, Amirhessam; Gandomi, Amir H.; Schulte, Mieke H. J.; Goudriaan, Anna E.; Foo, Simon Y.; Meyer-Baese, Anke: Optimized naive-Bayes and decision tree approaches for fMRI smoking cessation classification (2018)
  3. Altay, Ayca; Cinar, Didem: Fuzzy decision trees (2016)
  4. Hassani, Hossein; Huang, Xu; Silva, Emmanuel S.; Ghodsi, Mansi: A review of data mining applications in crime (2016)
  5. Rokach, Lior; Maimon, Oded: Data mining with decision trees. Theory and applications. (2015)
  6. Baralis, Elena; Cagliero, Luca; Cerquitelli, Tania; D’Elia, Vincenzo; Garza, Paolo: Expressive generalized itemsets (2014)
  7. Gama, João; Žliobaitė, Indrė; Bifet, Albert; Pechenizkiy, Mykola; Bouchachia, Abdelhamid: A survey on concept drift adaptation (2014)
  8. Khoshgoftaar, Taghi M.; Xiao, Yudong; Gao, Kehan: Software quality assessment using a multi-strategy classifier (2014) ioport
  9. Nasridinov, Aziz; Lee, Yangsun; Park, Young-Ho: Decision tree construction on GPU: ubiquitous parallel computing approach (2014) ioport
  10. Stojanova, Daniela; Ceci, Michelangelo; Appice, Annalisa; Džeroski, Sašo: Network regression with predictive clustering trees (2012)
  11. Salehi-Moghaddami, Nima; Yazdi, Hadi Sadoghi; Poostchi, Hanieh: Correlation based splitting criterionin multi branch decision tree (2011)
  12. Vreeken, Jilles; Van Leeuwen, Matthijs; Siebes, Arno: Krimp: mining itemsets that compress (2011)
  13. Bifet, Albert: Adaptive stream mining: Pattern learning and mining from evolving data streams. (2010)
  14. Chandra, B.; Kothari, Ravi; Paul, Pallath: A new node splitting measure for decision tree construction (2010)
  15. Chandra, B.; Varghese, P. Paul: Moving towards efficient decision tree construction (2009)
  16. Popova, E. A.: Method for parallel construction of a committee of decision tree for processing the electroencephalography signals (2009)
  17. Glimcher, Leonid; Jin, Ruoming; Agrawal, Gagan: Middleware for data mining applications on clusters and grids (2008) ioport
  18. Gonzáles-Aranda, P.; Menasalvas, E.; Millán, S.; Ruiz, Carlos; Segovia, J.: Towards a methodology for data mining project development: the importance of abstraction (2008)
  19. Hu, Hui-Ling; Chen, Yen-Liang: Mining typical patterns from databases (2008) ioport
  20. Castro, José; Secretan, Jimmy; Georgiopoulos, Michael; DeMara, Ronald; Anagnostopoulos, Georgios; Gonzalez, Avelino: Pipelining of Fuzzy ARTMAP without matchtracking: Correctness, performance bound, and Beowulf evaluation (2007)

1 2 3 next