You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The comparison viewer dashboard shows num_example_rows examples for every comparison vector present in df_predict. For sufficiently large datasets and complex models, the number of comparison vectors can become prohibitively large (I have an example where the dashboard is 1.4 GB with num_example_rows=2.
Currently, the only way to trim this down is to manipulate df_predict. This can easily be done if you want to view comparisons with a match probability between say 0.5 and 0.999, but would be more difficult to show only comparison vectors that appear >N times. Either or both of these options would be helpful to include in the dashboard function.
Describe the solution you'd like
A min_count argument so min_count=100 is one way to keep to a more manageable file size.
Is your proposal related to a problem?
The comparison viewer dashboard shows
num_example_rows
examples for every comparison vector present indf_predict
. For sufficiently large datasets and complex models, the number of comparison vectors can become prohibitively large (I have an example where the dashboard is 1.4 GB withnum_example_rows=2
.Currently, the only way to trim this down is to manipulate
df_predict
. This can easily be done if you want to view comparisons with a match probability between say 0.5 and 0.999, but would be more difficult to show only comparison vectors that appear >N times. Either or both of these options would be helpful to include in the dashboard function.Describe the solution you'd like
A
min_count
argument somin_count=100
is one way to keep to a more manageable file size.The text was updated successfully, but these errors were encountered: