Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Several independent groups #1181

Closed
rderidder-lda opened this issue Jan 18, 2024 · 2 comments
Closed

Several independent groups #1181

rderidder-lda opened this issue Jan 18, 2024 · 2 comments

Comments

@rderidder-lda
Copy link

Another support type question:

say my scenario is that over years, there are Events created, and each Event has participants associated.
Problem is that sometimes participants are duplicated.

I do NOT want to match between events.. but I want to get all the matches WITHIN each event.
How to best do this, knowing there are maybe 1 million events, each with a 4 or 5 participants.

I thought of using the 'EXACT' on the event (event id), and use text/shortstring matching on the participant names...and then let it run on the whole set...
However it seems that even with EXACT, sometimes it will stray away and match across..
Is there a better way or is there a way to be 100% sure it never matches across event ids? Make the Exact field100% required to be identical, and the other fields use the thresholds, etc.

Thanks
R

@rderidder-lda
Copy link
Author

maybe i should use Categorical.. but the categories would be the million event ids.. doesn't seem like its the right thing to do

@rderidder-lda
Copy link
Author

moved to discussion

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Feb 8, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant