Probability estimation for large-margin classifiers

Junhui Wang, Xiaotong Shen, Yufeng Liu

Research output: Contribution to journalArticlepeer-review

60 Scopus citations

Abstract

Large margin classifiers have proven to be effective in delivering high predictive accuracy, particularly those focusing on the decision boundaries and bypassing the requirement of estimating the class probability given input for discrimination. As a result, these classifiers may not directly yield an estimated class probability, which is of interest itself. To overcome this difficulty, this article proposes a novel method for estimating the class probability through sequential classifications, by using features of interval estimation of large-margin classifiers. The method uses sequential classifications to bracket the class probability to yield an estimate up to the desired level of accuracy. The method is implemented for support vector machines and ψ-learning, in addition to an estimated Kullback-Leibler loss for tuning. A solution path of the method is derived for support vector machines to reduce further its computational cost. Theoretical and numerical analyses indicate that the method is highly competitive against alternatives, especially when the dimension of the input greatly exceeds the sample size. Finally, an application to leukaemia data is described.

Original languageEnglish (US)
Pages (from-to)149-167
Number of pages19
JournalBiometrika
Volume95
Issue number1
DOIs
StatePublished - Mar 2008

Bibliographical note

Funding Information:
This research is supported in part by grants from the U. S. National Science Foundation. The authors would like to thank Ji Zhu for helpful discussions, and thank the editor and three reviewers for comments and suggestions.

Keywords

  • Function estimation
  • High dimension and low sample size
  • Interval estimate
  • Tuning
  • Weighting

Fingerprint

Dive into the research topics of 'Probability estimation for large-margin classifiers'. Together they form a unique fingerprint.

Cite this