Skip to content

Add additional convenience methods: process::{dedupe, extract} #16

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 17 commits into
base: master
Choose a base branch
from

Conversation

seanpianka
Copy link
Collaborator

@seanpianka seanpianka commented Sep 26, 2020

I've added dedupe and extract, along with a single unit-test for both.

However, the dedupe test is failing, as it seems to be returning the shortest duplicate, whereas the "canonical" duplicate is meant to be the longest duplicate (as it contains the most entity information). @logannc, I would appreciate your help debugging this issue... it seems similar to the max_by issue with extract_one that required a reversal of the matches.

I was going to add extract_bests, but it does not seem to differ in implementation (beyond defaults) from extract, so I've commented out the (duplicate) implementation.

Closes: #7
Depends on: #15

@seanpianka seanpianka requested a review from logannc September 26, 2020 06:33
@seanpianka seanpianka self-assigned this Sep 26, 2020
@seanpianka seanpianka added the wip work in progress label Sep 26, 2020
@seanpianka seanpianka marked this pull request as draft September 26, 2020 06:55
Also, add more usages of AsRef<str> trait bounds instead of directly
requesting &str.

Signed-off-by: Sean Pianka <[email protected]>
@seanpianka seanpianka force-pushed the 7-remaining-process-methods branch from a740183 to 77afc9a Compare September 26, 2020 07:44
@seanpianka seanpianka added the 1.0.0 Road to 1.0.0 label Nov 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
1.0.0 Road to 1.0.0 wip work in progress
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement remaining methods for process module
2 participants