Combining Self-Supervision and Privileged Information for Representation Learning from Tabular Data

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When building predictive models for real-world applications, many data are discarded because conventional learning algorithms cannot utilize it, although such data could be very informative. This paper focuses on representation learning using two types of additional data: privileged information (PI) and unlabeled data. PI refers to data available only during training but not at test time. Existing methods transfer the knowledge embedded in PI via supervised mechanisms, making them unable to use unlabeled data. In contrast, self-supervised learning methods can use unlabeled data but cannot learn from PI. While these techniques appear complementary, as we demonstrate, combining them is non-trivial. This paper introduces the Privileged Information Regularized (PIReg) self-supervised learning framework, which utilizes both PI and unlabeled data to learn better representations.

Original languageEnglish (US)
Title of host publicationProceedings - 24th IEEE International Conference on Data Mining, ICDM 2024
EditorsElena Baralis, Kun Zhang, Ernesto Damiani, Merouane Debbah, Panos Kalnis, Xindong Wu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages550-559
Number of pages10
ISBN (Electronic)9798331506681
DOIs
StatePublished - 2024
Event24th IEEE International Conference on Data Mining, ICDM 2024 - Abu Dhabi, United Arab Emirates
Duration: Dec 9 2024Dec 12 2024

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference24th IEEE International Conference on Data Mining, ICDM 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period12/9/2412/12/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • Healthcare
  • Privileged Information
  • Representation Learning
  • Self-Supervised Learning

Fingerprint

Dive into the research topics of 'Combining Self-Supervision and Privileged Information for Representation Learning from Tabular Data'. Together they form a unique fingerprint.

Cite this