Non-parametric Semi-supervised Learning by Bayesian Label Distribution Propagation

Jonatan Møller Nuutinen Gøttcke*, Arthur Zimek, Ricardo J.G.B. Campello

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

67 Downloads (Pure)

Abstract

Semi-supervised classification methods are specialized to use a very limited amount of labelled data for training and ultimately for assigning labels to the vast majority of unlabelled data. Label propagation is such a technique that assigns labels to those parts of unlabelled data that are in some sense close to labelled examples and then uses these predicted labels in turn to predict labels of more remote data. Here we propose to not propagate an immediate label decision to neighbors but to propagate the label probability distribution. This way we keep more information and take into account the remaining uncertainty of the classifier. We employ a Bayesian schema that is simpler and more straightforward than existing methods. As a consequence we avoid to propagate errors by decisions taken too early. A crisp decision can be derived from the propagated label distributions at will. We implement and test this strategy with a probabilistic k-nearest neighbor classifier, proving competitive with several state-of-the-art competitors in quality and more efficient in terms of computational resources.

Original languageEnglish
Title of host publicationSimilarity Search and Applications : 14th International Conference
EditorsNora Reyes, Richard Connor, Nils Kriege, Daniyal Kazempour, Ilaria Bartolini, Erich Schubert, Jian-Jia Chen
PublisherSpringer Science+Business Media
Publication date2021
Pages118-132
ISBN (Print)978-3-030-89656-0
ISBN (Electronic)978-3-030-89657-7
DOIs
Publication statusPublished - 2021
Event14th International Conference on Similarity Search and Applications, SISAP 2021 - Dortmund, Germany
Duration: 29. Sept 20211. Oct 2021

Conference

Conference14th International Conference on Similarity Search and Applications, SISAP 2021
Country/TerritoryGermany
CityDortmund
Period29/09/202101/10/2021
SeriesLecture Notes in Computer Science
Volume13058 LNCS
ISSN0302-9743

Bibliographical note

Publisher Copyright:
© 2021, Springer Nature Switzerland AG.

Keywords

  • k-Nearest neighbor classification
  • Label propagation
  • Semi-supervised classification
  • Transductive learning

Fingerprint

Dive into the research topics of 'Non-parametric Semi-supervised Learning by Bayesian Label Distribution Propagation'. Together they form a unique fingerprint.

Cite this