-
Hello! When estimating I'm asking because my model is identifying pairs as matches even when first name is completely different & I'm concerned that its because it is inappropriately defining matches when estimating |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
There are starting values of the m probabilities built into Splink. They can be overridden by manually specifying them in your settings object. But I think the problem you're seeing is less likely to be down to the wrong starting values for m probabilities, and more likely to be down to other issues with your data or model spec, possibly strong correlations between your input columns causing violations of conditional independence, resulting in double counting and hence causing problems with the EM approach. |
Beta Was this translation helpful? Give feedback.
There are starting values of the m probabilities built into Splink. They can be overridden by manually specifying them in your settings object. But I think the problem you're seeing is less likely to be down to the wrong starting values for m probabilities, and more likely to be down to other issues with your data or model spec, possibly strong correlations between your input columns causing violations of conditional independence, resulting in double counting and hence causing problems with the EM approach.