AN ITERATIVE ALGORITHM TO LEARN FROM POSITIVE AND UNLABELED EXAMPLES

Xin Liu, Qingle Zheng, Xiaotong Shen, Shaoli Wang

Research output: Contribution to journalArticlepeer-review

Abstract

In semi-supervised learning, a training sample comprises both labeled and unlabeled instances from each class under consideration. In practice, an important, yet challenging issue is the detection of novel classes that may be absent from the training sample. Here, we focus on the binary situation in which labeled instances come from the positive class, and unlabeled instances come from both classes. In particular, we propose a semi-supervised large-margin classifier to learn the negative (novel) class based on pseudo-data generated iteratively using an estimated model. Numerically, we employ an efficient algorithm to implement the proposed method using the hinge loss and ψ-loss functions. Theoretically, we derive a learning theory for the new classifier in order to quantify the misclassification error. Finally, a numerical analysis demonstrates that the proposed method compares favorably with its competitors on simulated examples, and is highly competitive on benchmark examples.

Original languageEnglish (US)
Pages (from-to)961-982
Number of pages22
JournalStatistica Sinica
Volume32
Issue number2
DOIs
StatePublished - Apr 2022

Bibliographical note

Publisher Copyright:
© 2022 Institute of Statistical Science. All rights reserved.

Keywords

  • Biased SVM
  • PU learning
  • iterative algorithm
  • large-margins

Fingerprint

Dive into the research topics of 'AN ITERATIVE ALGORITHM TO LEARN FROM POSITIVE AND UNLABELED EXAMPLES'. Together they form a unique fingerprint.

Cite this