Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Possible Feature Request(?): AUC metric and optimism via bootstrapping

See original GitHub issue

Describe the workflow you want to enable

In the computation of AUC, with Confidence intervals, one may either use cross-validation hold-out data (e.g. any of the <X>-Fold functions in sklearn).

However, this procedure can be a bit problematic especially in low-sample data settings. In these low-sample data settings, it is sometimes suggested to instead use bootstrapping to estimate the “optimism” of the model, which can then be corrected for in the roc_auc_score.

Describe your proposed solution

Under the inspection, or utils module perhaps, a compute_metric_optimism() function could be used perhaps? This can be extended to metrics beyond just AUC.

Some pseudo-code:

def compute_metric_optimism(estimator: BaseEstimator, n_bootstrap: int, scoring: <scoring_func>, n_jobs=None):
    # compute the original metric on original data  
    orig_score = scoring()

    for boot_idx in range(n_bootstrap):
        # fit model
        estimator.fit()

        # compute score  on sample
        score = scoring()

    # compute optimism
    optimism = <average over the bootstrap samples>

    # compute optimism adjusted metric
    return orig_score - optimism

Describe alternatives you’ve considered, if relevant

If maintainers feel it is not necessary, then it is always possible to do it outside of sklearn, either w/ 3rd party packages, or self-implemented.

Additional context

For some reference on the procedure in R: https://thestatsgeek.com/2014/10/04/adjusting-for-optimismoverfitting-in-measures-of-predictive-ability-using-bootstrapping/

Happy to try a PR if this goes off.

Issue Analytics

State:
Created 3 years ago
Comments:9 (9 by maintainers)

Top GitHub Comments

1reaction

adrinjalalicommented, Jul 10, 2020

Considering cross_validate doesn’t require the splits to be disjoint sets, and that it returns the test scores as well as the train scores, a very short example in the example gallery would be welcome.

0reactions

adrinjalalicommented, Oct 20, 2021

Closing as WONTFIX, please refer to the comments on the PR