Bayesian Estimation Approaches for Local Intrinsic Dimensionality

Zaher Joukhadar*, Hanxun Huang, Sarah Monazam Erfani, Ricardo J. G. B. Campello, Michael E. Houle, James Bailey

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingBook chapterResearchpeer-review

Abstract

Local Intrinsic Dimensionality (LID) is a measure of data complexity in the vicinity of a query point. In this work, we address the problem of estimating LID from a Bayesian perspective by establishing a theoretical framework that derives the distribution of LID given a data sample. Using this framework, we develop new LID estimators that can outperform the Maximum Likelihood Estimator (MLE) in certain contexts. The framework also provides a convenient way to incorporate prior LID knowledge through informative priors. Additionally, we demonstrate how to aggregate multiple LID distributions in a Bayesian manner using logarithmic pooling. We conduct a variety of experiments, demonstrating that a Bayesian approach to LID is effective with a small number of nearest neighbors and when incorporating informative priors. We also show that in deep neural networks, MLE produces highly volatile LID estimates, whereas a Bayesian approach that incorporates prior LID information smoothes and reduces the variance of these estimates.
Original languageEnglish
Title of host publicationSimilarity Search and Applications
EditorsEdgar Chávez, Benjamin Kimia, Jakub Lokoč, Marco Patella, Jan Sedmidubsky
PublisherSpringer
Publication date25. Oct 2024
Pages111-125
ISBN (Print)978-3-031-75822-5
ISBN (Electronic)978-3-031-75823-2
DOIs
Publication statusPublished - 25. Oct 2024
Event17th International Conference of Similarity Search and Applications - Providence, United States
Duration: 4. Nov 20246. Nov 2024

Conference

Conference17th International Conference of Similarity Search and Applications
Country/TerritoryUnited States
CityProvidence
Period04/11/202406/11/2024
SeriesLecture Notes in Computer Science
Volume15268
ISSN0302-9743

Fingerprint

Dive into the research topics of 'Bayesian Estimation Approaches for Local Intrinsic Dimensionality'. Together they form a unique fingerprint.

Cite this