Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEAT] Additional argument to filter comparisons shown in comparison viewer dashboard #2318

Open
samnlindsay opened this issue Aug 9, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@samnlindsay
Copy link
Contributor

Is your proposal related to a problem?

The comparison viewer dashboard shows num_example_rows examples for every comparison vector present in df_predict. For sufficiently large datasets and complex models, the number of comparison vectors can become prohibitively large (I have an example where the dashboard is 1.4 GB with num_example_rows=2.

Currently, the only way to trim this down is to manipulate df_predict. This can easily be done if you want to view comparisons with a match probability between say 0.5 and 0.999, but would be more difficult to show only comparison vectors that appear >N times. Either or both of these options would be helpful to include in the dashboard function.

Describe the solution you'd like

A min_count argument so min_count=100 is one way to keep to a more manageable file size.

comparison_viewer_dashboard(
    df_predict, 
    out_path, 
    overwrite=False, 
    num_example_rows=2, 
    return_html_as_string=False,
    min_count=1
)
@samnlindsay samnlindsay added the enhancement New feature or request label Aug 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant