Skip to content

m estimation: where does the initial decision rule (match/not match) come from? #1978

Answered by RobinL
nadd0u asked this question in Q&A
Discussion options

You must be logged in to vote

There are starting values of the m probabilities built into Splink. They can be overridden by manually specifying them in your settings object. But I think the problem you're seeing is less likely to be down to the wrong starting values for m probabilities, and more likely to be down to other issues with your data or model spec, possibly strong correlations between your input columns causing violations of conditional independence, resulting in double counting and hence causing problems with the EM approach.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by nadd0u
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants