Registering prediction parquet(s) for clustering with Splink 4 #2327
-
Hi! I was able to do this with Splink 3 but it seems like the functionality has been removed from the docs for Splink 4 so that's why I'm asking here. Basically I have run the prediction step for a few models and saved those predictions as parquet files. I want to create a table with DuckDB which is the UNION ALL (BY NAME) of those parquet files, register that DuckDB table as a Splink data table, and then run |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
Is this what you want? That function should be available at linker.table_management It looks like it's not showing up in the docs because we've forgotten to write a do string, thanks for bringing to our attention! Edit: I've just added a docstring here |
Beta Was this translation helpful? Give feedback.
Is this what you want?
splink/splink/internals/linker_components/table_management.py
Line 147 in 116e2ff
That function should be available at linker.table_management
It looks like it's not showing up in the docs because we've forgotten to write a do string, thanks for bringing to our attention!
Edit: I've just added a docstring here
#2328