question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Replace deprecated Boston housing prices dataset with California housing dataset

See original GitHub issue

Motivation

Since scikit-learn 1.0.0, the Boston housing prices dataset has been deprecated due to the ethical problem. The dataset will be removed at scikit-learn 1.2.0. Please see also the scikit-learn’s page.

The scikit-learn documentation suggesting using California housing dataset instead of the Boston dataset.

Description

Please replace load_boston with fetch_california_housing.

Alternatives (optional)

Additional context (optional)

Optuna has not used scikit-learn >= 1.0.0. yet. See relased issue: https://github.com/optuna/optuna/pull/2953.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Reactions:2
  • Comments:11 (4 by maintainers)

github_iconTop GitHub Comments

2reactions
avats-devcommented, Sep 30, 2021

Hii @nzw0301, I want to contribute to optuna and I have long term aims to contribute here. I’m thinking of starting with this one. I understood that I have to replace boston housing dataset with california housing dataset as mentioned in the deprecation warning from scikit-learn and that california housing dataset is also backward compatible upto almost ver-0.2 of scikit-learn.

So how should I proceed further? Should I replace every instance of load_boston in the repo?

1reaction
nzw0301commented, Oct 16, 2021

Great. Thank you for letting me know about the unlabelled issue. I’ve added the Hacktoverfest label to this issue.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Replace references to deprecated Boston housing prices ...
As noted in sklearn's documenation, the Boston housing dataset is being deprecated due to a significant ethical concern: Warning: The Boston ...
Read more >
Above-average features in Boston - UpSetPlot documentation
Explore above-average neighborhood characteristics in the Boston dataset. Here we take some features correlated with house price, and look at the ...
Read more >
Boston House Prediction | Kaggle
The Boston housing prices dataset has an ethical problem. ... the California housing dataset and:: from sklearn.datasets import fetch_openml ...
Read more >
2A.ML101.4: Supervised Learning: Regression of Housing Data
We'll use the simple Boston house prices set, available in scikit-learn. This records measurements of 13 attributes of housing markets around Boston, as...
Read more >
Olivier Grisel on Twitter: "In scikit-learn 1.0, we decided to ...
The purpose of that dataset was to study the impact of air quality on house prices. Its authors did not study racial segregation...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found