Skip to content

Best way to deal with outputs? #2387

Answered by RobinL
LeCodeMinister asked this question in Q&A
Discussion options

You must be logged in to vote

It's generally most efficient to extract the results from the SplinkDataFrame in its native format - as in, native to the backend you've chosen

So if you're using the DuckDBLinker, as a DuckDBPyRelation, using splink_dataframe.as_duckdbpyrelation() or if you're using the Spark backend as a Spark dataframe (using predictions_splink_dataframe.as_spark_dataframe()

You can also get the table/view name from the SplinKDataFrame using predictions_splink_dataframe.physical_name and then just query the database directly

e.g.

con = duckdb.connect()
con = DuckDBAPI(con)
linker = Linker(df, settings, db_api)

predictions = linker.inference.predict()

con.sql(f"select * from {predictions.physical_name…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@LeCodeMinister
Comment options

Answer selected by LeCodeMinister
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants