possible error in user guide for log loss
See original GitHub issueDescription
The user guide for log loss (section 3.3.2.11) says (in the second paragraph, just before the equation):
the log loss per sample is the negative log-likelihood of the classifier given the true label
I think that this is backwards; I think that it’s the negative log-likelihood of the true label given the probability predictions from the classifier. The equation in the subsequent line has the conditioning on the predictions. Also, the documentation for the function metrics.log_loss
describes it as I wrote it.
Steps/Code to Reproduce
Expected Results
Actual Results
Versions
I’m looking at the documentation for sklearn 0.22.1.
Issue Analytics
- State:
- Created 4 years ago
- Reactions:1
- Comments:9 (7 by maintainers)
Top Results From Across the Web
Intuition behind Log-loss score - Towards Data Science
A model with perfect skill has a log-loss score of 0. In other words, the model predicts each observation's probability as the actual...
Read more >Log Loss Function Explained by Experts - Dasha.AI
Logarithmic loss indicates how close a prediction probability comes to the actual/corresponding true value. As a result, it turns out that ...
Read more >Log Loss Function: Machine Learning error measurement metric
Log loss is generally called as loss function and it is the measure of error. We want our error to be as small...
Read more >Log Loss - Logistic Regression's Cost Function for Beginners
Log loss is a logistic regression cost function and a classification metric based on probabilities. We cover the math behind the log loss....
Read more >What's considered a good log loss? - Cross Validated
So maybe Logloss would not be the best metric to use. binary classification logloss, unbalanced - chart. b. Three-class case.
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
From the wikipedia link you provided:
Saying “given the data” is just a way of saying that the dataset is considered fixed, while what varies are the parameters of the models
The likelihood is a function of the parameters of the model given some data, so IMO the user-guide formulation is correct and we should update the docstring