Replace deprecated Boston housing prices dataset with California housing dataset
See original GitHub issueMotivation
Since scikit-learn 1.0.0, the Boston housing prices dataset has been deprecated due to the ethical problem. The dataset will be removed at scikit-learn 1.2.0. Please see also the scikit-learn’s page.
The scikit-learn documentation suggesting using California housing dataset instead of the Boston dataset.
Description
Please replace load_boston
with fetch_california_housing
.
Alternatives (optional)
Additional context (optional)
Optuna has not used scikit-learn >= 1.0.0. yet. See relased issue: https://github.com/optuna/optuna/pull/2953.
Issue Analytics
- State:
- Created 2 years ago
- Reactions:2
- Comments:11 (4 by maintainers)
Top Results From Across the Web
Replace references to deprecated Boston housing prices ...
As noted in sklearn's documenation, the Boston housing dataset is being deprecated due to a significant ethical concern: Warning: The Boston ...
Read more >Above-average features in Boston - UpSetPlot documentation
Explore above-average neighborhood characteristics in the Boston dataset. Here we take some features correlated with house price, and look at the ...
Read more >Boston House Prediction | Kaggle
The Boston housing prices dataset has an ethical problem. ... the California housing dataset and:: from sklearn.datasets import fetch_openml ...
Read more >2A.ML101.4: Supervised Learning: Regression of Housing Data
We'll use the simple Boston house prices set, available in scikit-learn. This records measurements of 13 attributes of housing markets around Boston, as...
Read more >Olivier Grisel on Twitter: "In scikit-learn 1.0, we decided to ...
The purpose of that dataset was to study the impact of air quality on house prices. Its authors did not study racial segregation...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Hii @nzw0301, I want to contribute to optuna and I have long term aims to contribute here. I’m thinking of starting with this one. I understood that I have to replace boston housing dataset with california housing dataset as mentioned in the deprecation warning from scikit-learn and that california housing dataset is also backward compatible upto almost ver-0.2 of scikit-learn.
So how should I proceed further? Should I replace every instance of
load_boston
in the repo?Great. Thank you for letting me know about the unlabelled issue. I’ve added the
Hacktoverfest
label to this issue.