Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add row count for conditional_join #1269

Open
samukweku opened this issue Jun 14, 2023 · 1 comment · May be fixed by #1457
Open

add row count for conditional_join #1269

samukweku opened this issue Jun 14, 2023 · 1 comment · May be fixed by #1457
Labels
enhancement New feature or request

Comments

@samukweku
Copy link
Collaborator

Brief Description

add a row count for matches from the left dataframe in the right dataframe

Example API

(d2
.conditional_join(
    d1, 
    ('pos', 'segment.start', '>='), 
    ('pos', 'segment.end', '<='),
    ('chr', 'chr', '=='), 
    return_counts=True)
)
@samukweku
Copy link
Collaborator Author

We can possibly extend this idea to aggregate (sum, min,max,...) ... Might be limited to numba at the moment, unless we figure out an efficient way to take advantage of pandas groupby without building a dataframe first 🤔

@samukweku samukweku linked a pull request Mar 20, 2025 that will close this issue
@samukweku samukweku linked a pull request Mar 20, 2025 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants