Abstract
In semi-supervised learning, a training sample comprises both labeled and unlabeled instances from each class under consideration. In practice, an important, yet challenging issue is the detection of novel classes that may be absent from the training sample. Here, we focus on the binary situation in which labeled instances come from the positive class, and unlabeled instances come from both classes. In particular, we propose a semi-supervised large-margin classifier to learn the negative (novel) class based on pseudo-data generated iteratively using an estimated model. Numerically, we employ an efficient algorithm to implement the proposed method using the hinge loss and ψ-loss functions. Theoretically, we derive a learning theory for the new classifier in order to quantify the misclassification error. Finally, a numerical analysis demonstrates that the proposed method compares favorably with its competitors on simulated examples, and is highly competitive on benchmark examples.
Original language | English (US) |
---|---|
Pages (from-to) | 961-982 |
Number of pages | 22 |
Journal | Statistica Sinica |
Volume | 32 |
Issue number | 2 |
DOIs | |
State | Published - Apr 2022 |
Bibliographical note
Publisher Copyright:© 2022 Institute of Statistical Science. All rights reserved.
Keywords
- Biased SVM
- PU learning
- iterative algorithm
- large-margins