Skip to content

Is there any way that we can save linker object in spark after training and predicting #827

Answered by RobinL
rapidAmbakar asked this question in Q&A
Discussion options

You must be logged in to vote

To save the outputs you will need to save linker.predict() to a dataframe using:

df_predict.as_pandas_dataframe()

or

df_predict.as_spark_dataframe().write.parquet("output_folder") 

To make it run faster, you could consider using tighter blocking rules, see
https://moj-analytical-services.github.io/splink/demos/02_Blocking.html

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by rapidAmbakar
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants