COFFIN: Computational Framework for Linear SVMs In a variety of applications, kernel machines such as Support Vector Machines (SVMs) have been used with great success often delivering state-of-the-art results. Using the kernel trick, they work on several domains and even enable heterogeneous data fusion by concatenating feature spaces or multiple kernel learning. Unfortunately, they are not suited for truly large-scale applications since they suffer from the curse of supporting vectors, e.g., the speed of applying SVMs decays linearly with the number of support vectors. In this paper we develop COFFIN --- a new training strategy for linear SVMs that effectively allows the use of on demand computed kernel feature spaces and virtual examples in the primal. With linear training and prediction effort this framework leverages SVM applications to truly large-scale problems: As an example, we train SVMs for human splice site recognition involving 50 million examples and sophisticated string kernels. Additionally, we learn an SVM based gender detector on 5 million examples on low-tech hardware and achieve beyond the state-of-the-art accuracies on both tasks.

References in zbMATH (referenced in 8 articles )

Showing results 1 to 8 of 8.
Sorted by year (citations)

  1. Yang, Tianbao; Zhang, Lijun; Lin, Qihang; Zhu, Shenghuo; Jin, Rong: High-dimensional model recovery from random sketched data by exploring intrinsic sparsity (2020)
  2. May, Avner; Garakani, Alireza Bagheri; Lu, Zhiyun; Guo, Dong; Liu, Kuan; Bellet, Aurélien; Fan, Linxi; Collins, Michael; Hsu, Daniel; Kingsbury, Brian; Picheny, Michael; Sha, Fei: Kernel approximation methods for speech recognition (2019)
  3. Mahajan, Dhruv; Agrawal, Nikunj; Keerthi, S. Sathiya; Sellamanickam, Sundararajan; Bottou, Léon: An efficient distributed learning algorithm based on effective local functional approximations (2018)
  4. Petitjean, François; Buntine, Wray; Webb, Geoffrey I.; Zaidi, Nayyar: Accurate parameter estimation for Bayesian network classifiers using hierarchical Dirichlet processes (2018)
  5. Mahajan, Dhruv; Keerthi, S. Sathiya; Sundararajan, S.: A distributed block coordinate descent method for training (l_1) regularized linear classifiers (2017)
  6. Antoniuk, Kostiantyn; Franc, Vojtěch; Hlaváč, Václav: V-shaped interval insensitive loss for ordinal classification (2016)
  7. Martínez, Ana M.; Webb, Geoffrey I.; Chen, Shenglei; Zaidi, Nayyar A.: Scalable learning of Bayesian network classifiers (2016)
  8. Zaidi, Nayyar A.; Webb, Geoffrey I.; Carman, Mark J.; Petitjean, François; Cerquides, Jesús: (\textALR^n): accelerated higher-order logistic regression (2016)

Further publications can be found at: