Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: balanced clustering #2999

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from
Draft

feat: balanced clustering #2999

wants to merge 1 commit into from

Conversation

jacketsj
Copy link
Contributor

@jacketsj jacketsj commented Oct 11, 2024

This implements FCFC from this paper.
Adds a parameter "balance_factor" to kmeans. Based on the paper's experimental results, 1.0 seems to be a fairly decent default value, and we could consider making it the default in the future.

This feature is restricted to the torch implementation for now. I think for it to work well, we'd actually have to incorporate it into the query path.

Later update: I'm actually not able to replicate the results from the paper at the moment, at least on the datasets I've tried.

This comment was marked as resolved.

@jacketsj jacketsj force-pushed the jack/balanced-clustering branch from b4d8738 to 9493865 Compare October 11, 2024 17:38
@jacketsj jacketsj changed the title Balanced Clustering feat: Balanced Clustering Oct 11, 2024
@github-actions github-actions bot added the enhancement New feature or request label Oct 11, 2024
@jacketsj jacketsj changed the title feat: Balanced Clustering feat: balanced clustering Oct 11, 2024
@jacketsj jacketsj force-pushed the jack/balanced-clustering branch from 9493865 to 83a30a4 Compare October 18, 2024 04:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request python
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant