Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation of local outlier factor #1825

Open
wants to merge 16 commits into
base: main
Choose a base branch
from

Conversation

Hakdag97
Copy link
Collaborator

Description

Added an implementation of the local outlier factor (lof) used for outlier classification. The bottleneck to reduce memory consumption lies in the pairwise distance matrix. Memory consumption can be significantly reduced by taking only the n smallest elements in the distance matrix into account (only these are needed to compute the lof). Thus, a new distance matrix, called cdist_small, was established that combines the functionality of cdist and topk, but has the advantage that the smallest n distances are dynamically choosen during computation without evaluating the whole distance matrix cdist at once.

Issue/s resolved: #1758

Changes proposed:

  • New implementation of lof
  • New implementation of distance matrix cdist_small with efficient memory consumption

Type of change

  • New feature

Does this change modify the behaviour of other functions? If so, which?

no

Copy link
Contributor

Thank you for the PR!

@Hakdag97 Hakdag97 requested a review from mrfh92 March 14, 2025 14:03
@ClaudiaComito ClaudiaComito modified the milestones: 1.5.2, 1.6 Mar 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Todo
Development

Successfully merging this pull request may close these issues.

Implementation of local outlier factor
2 participants