Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[BUG] Stacking classifier cannot use Thresholder function - no .predict_proba

See original GitHub issue

Description:

I’m able to use the thresholder on sklearn’s voting classifer, but not on the stacking classifier. It throws this error, which I believe is in error. StackingClassifier does have predict_proba. Maybe I’m missunderstanding the use case, but this seems to fit.

ValueError: The Thresholder meta model only works on classifcation models with .predict_proba.

Code for reproduction (using the sklearn sample data for StackingClassifier):

from sklearn.datasets import load_iris
from sklearn.ensemble import RandomForestClassifier
from sklearn.svm import LinearSVC
from sklearn.linear_model import LogisticRegression
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import make_pipeline
from sklearn.ensemble import StackingClassifier

X, y = load_iris(return_X_y=True)
estimators = [
    ('rf', RandomForestClassifier(n_estimators=10, random_state=42)),
    ('svr', make_pipeline(StandardScaler(), LinearSVC(random_state=42)))]
clf = StackingClassifier(    estimators=estimators, final_estimator=LogisticRegression())
clf.fit(X, y)

a = Thresholder(clf, threshold=0.2)
a.fit(X, y)
a.predict(X)

Full trace:

ValueError                                Traceback (most recent call last)
<ipython-input-26-1b89dbfa16b8> in <module>
     16 
     17 a = Thresholder(clf, threshold=0.2)
---> 18 a.fit(X_train_std, np.ceil(y_train[targets[2]]))
     19 a.predict(X_train_std)

~\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.9_qbz5n2kfra8p0\LocalCache\local-packages\Python39\site-packages\sklego\meta\thresholder.py in fit(self, X, y, sample_weight)
     54         self.estimator_ = clone(self.model)
     55         if not isinstance(self.estimator_, ProbabilisticClassifier):
---> 56             raise ValueError(
     57                 "The Thresholder meta model only works on classifcation models with .predict_proba."
     58             )

ValueError: The Thresholder meta model only works on classifcation models with .predict_proba.

Issue Analytics

State:
Created a year ago
Comments:16 (11 by maintainers)

Top GitHub Comments

2reactions

koaningcommented, Apr 2, 2022

The PR is now merged into the main branch. I’d like to give it another week of waiting to see if other PRs come in. If so, we may be able to batch together a few fixes into new releases on PyPI.

@MarkusDegen, thanks for the PR!

1reaction

MarkusDegencommented, Mar 30, 2022

Unit test shows same error. I guess i need to slim it down a bit and add some asserts. Need to learn what the stacking classifier actually does

Top Results From Across the Web

Stacking classifier has no attribute predict_proba #633 - GitHub

Current stacking classifiers would fail to stack non predict_proba compatible base estimators when use_proba is set to True.

python - SciKit learn predict_proba - move threshold from .5 to ...

I read that SciKit learn uses the . 5 as the threshold to make a decision from the predict_proba() score. I'm trying to...

A Gentle Introduction to Threshold-Moving for Imbalanced ...

First, let's fit a model and calculate a ROC Curve. We can use the make_classification() function to create a synthetic binary classification ......

sklearn.ensemble.StackingClassifier

Stacking allows to use the strength of each individual estimator by using their output as input of a final estimator.

Predict_proba for Binary classifier in Tensorflow

Last layer wrapped with Sigmoidal Function and just returning a single value. For my prediction I just set a standard threshold value of...

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

[BUG] Stacking classifier cannot use Thresholder function - no .predict_proba

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

ZeroInflatedRegressor with AttributeError

[FEATURE] Run with given regressor instead of raising warning in ZeroInflatedRegressor