p-values for bootstrapped performance comparison #376

ndiamant · 2020-07-29T16:33:43Z

What
plots._protected_subplots makes box plots for model performance per protected class. It should also give a p-value for whether the performance across classes is the same.

Why
What we ultimately want to know is whether the performance is different across classes. You can get an idea of that from the box plots, but it's unclear what conclusion to draw from them without a p-value.

How
Figure out what p-value to calculate then make a helper function to calculate it in plots.py. Currently performance is evaluated in plots._bootstrap_performance and plots._performance_by_index.

Acceptance Criteria
plots._protected_subplots calculates and displays p-values for performance across classes being the same.

The text was updated successfully, but these errors were encountered:

lucidtronix · 2020-07-31T17:13:32Z

Should we do Mann Whitney or Chi squared test here?

ndiamant added enhancement New feature or request good first issue Good for newcomers labels Jul 29, 2020

ndiamant mentioned this issue Jul 29, 2020

Nd bias bootstrap #374

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

p-values for bootstrapped performance comparison #376

p-values for bootstrapped performance comparison #376

ndiamant commented Jul 29, 2020

lucidtronix commented Jul 31, 2020

p-values for bootstrapped performance comparison #376

p-values for bootstrapped performance comparison #376

Comments

ndiamant commented Jul 29, 2020

lucidtronix commented Jul 31, 2020