Notes and links on SVMs (WIP)

What is an SVM?

Seems like a nice math trick to find the location of the median point, line, surface, or plane that separates data.

It can work with data that is not linearly separable in 2 dimensions but is in 3 (or more) dimensions.

All you need is another point of view

All you need is another point of view

The algebra

MIT Course - 50 mins

Another vid - 15 minutes

SVM kernals

SVM Kernels : Data Science Concepts

How to take a “short-cut” when calculating the products of the data transformed into a higher dimension. Keeps the complexity at n² rather than n*P (where P is the number of dimensions) + n².

  • linear
  • polynomial
  • radial basis function (RBF)

Visualisation with Scikit-learn

Knime

youtu.be/0V_in5gr1…

Foundational Support Vector Machine Publications

  1. A Training Algorithm for Optimal Margin Classifiers
    Bernhard E. Boser, Isabelle M. Guyon & Vladimir N. Vapnik (1992)
    https://dl.acm.org/doi/10.1145/130385.130401

  2. Support-Vector Networks
    Corinna Cortes & Vladimir Vapnik (1995)
    https://doi.org/10.1007/BF00994018

  3. The Nature of Statistical Learning Theory
    Vladimir N. Vapnik (1995)
    https://doi.org/10.1007/978-1-4757-3264-1

  4. A Tutorial on Support Vector Machines for Pattern Recognition
    Christopher J. C. Burges (1998)
    https://doi.org/10.1023/A:1009715923555

  5. Support Vector Regression Machines
    Harris Drucker, Christopher J. C. Burges, Linda Kaufman, Alex J. Smola & Vladimir Vapnik (1996)
    https://papers.nips.cc/paper/1238-support-vector-regression-machines

  6. Making Large-Scale SVM Learning Practical
    Thorsten Joachims (1999)
    https://www.cs.cornell.edu/~tj/publications/joachims_99a.pdf

  7. New Support Vector Algorithms
    Bernhard Schölkopf, Alex J. Smola, Robert C. Williamson & Peter L. Bartlett (2000)
    https://doi.org/10.1162/089976600300015565

Additional Foundational and Influential SVM Papers

  1. Support Vector Machines and Kernel Methods: The New Generation of Learning Machines
    Nello Cristianini & Bernhard Schölkopf (2002)
    A comprehensive overview of kernel methods and SVMs, highlighting their theoretical foundations and practical applications across various domains.
    Link

  2. The Entire Regularization Path for the Support Vector Machine
    Trevor Hastie, Saharon Rosset, Robert Tibshirani & Ji Zhu (2004)
    Introduces an algorithm to efficiently compute the entire solution path of SVMs as the regularization parameter varies, providing insights into model selection and regularization effects.
    Link

  3. Learning from Distributions via Support Measure Machines
    Krikamol Muandet, Kenji Fukumizu, Francesco Dinuzzo & Bernhard Schölkopf (2012)
    Extends SVMs to operate on probability distributions by embedding them into a reproducing kernel Hilbert space, enabling learning from complex data types.
    Link

  4. Learning Optimally Sparse Support Vector Machines
    Andrew Cotter, Shai Shalev-Shwartz & Nathan Srebro (2013)
    Proposes methods to train SVMs that achieve optimal sparsity, reducing the number of support vectors without compromising performance.
    Link

  5. Deep Learning using Linear Support Vector Machines
    Yichuan Tang (2013)
    Demonstrates that integrating linear SVMs into deep learning architectures can enhance classification performance compared to traditional softmax layers.
    Link

  6. Learning by Transduction
    Alex Gammerman, Volodya Vovk & Vladimir Vapnik (2013)
    Introduces a transductive learning approach based on SVMs, focusing on predicting specific test instances rather than generalizing across the entire input space.
    Link

  7. Support Vector Machines with Applications
    Javier M. Moguerza & Alberto Muñoz (2006)
    Provides a thorough introduction to SVMs, discussing their theoretical underpinnings and showcasing applications in various fields.
    Link

  8. Support Vector Method for Function Approximation, Regression Estimation, and Signal Processing
    Authors: Vladimir Vapnik, Steven E. Golowich, Alex J. Smola (1996)
    Overview: This paper extends SVMs to regression tasks, introducing the concept of Support Vector Regression (SVR), which has become a fundamental technique in machine learning.
    Link: Proceedings of NIPS 1996

  9. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods
    Authors: Nello Cristianini, John Shawe-Taylor (2000)
    Overview: A comprehensive textbook that provides an in-depth introduction to SVMs and kernel methods, suitable for both students and practitioners.
    Link: Cambridge University Press

  10. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
    Authors: Bernhard Schölkopf, Alexander J. Smola (2002)
    Overview: This book delves into the theoretical foundations of SVMs and kernel methods, offering insights into their practical applications.
    Link: MIT Press

  11. Support Vector Machines: Theory and Applications
    Authors: Theodoros Evgeniou, Massimiliano Pontil (2001)
    Overview: A collection discussing the theory behind SVMs and their diverse applications across various domains.
    Link: Springer

  12. A Comprehensive Survey on Support Vector Machine Classification
    Authors: K. M. M. Prabhu, R. K. Selvakumar (2020)
    Overview: This survey provides an overview of SVM classification techniques, applications, and future research directions.
    Link: ScienceDirect

  13. Methods for Class-Imbalanced Learning with Support Vector Machines: A Review and an Empirical Evaluation
    Authors: Salim Rezvani, Farhad Pourpanah, Chee Peng Lim, Q. M. Jonathan Wu (2024)
    Overview: Discusses approaches to handle class imbalance in SVMs, including resampling and algorithmic methods.
    Link: Springer

  14. Comprehensive Review on Twin Support Vector Machines
    Authors: M. Tanveer, T. Rajani, R. Rastogi, Y. H. Shao, M. A. Ganaie (2021)
    Overview: Explores Twin SVMs, which solve two smaller optimization problems, offering computational advantages.
    Link: arXiv

  15. High-Performance Support Vector Machines and Its Applications
    Authors: Taiping He, Tao Wang, Ralph Abbey, Joshua Griffin (2019)
    Overview: Proposes a distributed SVM algorithm suitable for large-scale data and demonstrates its applications.
    Link: arXiv