Replies: 1 comment 2 replies
-
https://moj-analytical-services.github.io/splink/demos/examples/duckdb/transactions.html |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I wonder whether Splink would be a good library for identifying duplicate transactions in a dataset. For example duplicate payments based on the amount, date, vendor name, and invoice reference. The expected number of matches would be several orders of magnitude lower than when using Splink for classic deduplication of a person dataset, therefore I am not sure the probabilistic model would work as well. Any thoughts on that ?
Beta Was this translation helpful? Give feedback.
All reactions