Suspecious bug
See original GitHub issueI’ve noticed an abnormal phenomenon for seq2seq model: the validation loss is always lower than training loss, both on Twitter and OpenSubtitles dataset.
And I found this part of codes quite suspicious: https://github.com/facebookresearch/ParlAI/blob/09351eec1076319a9f5ea0f6a0c242d5f35fa43d/parlai/agents/seq2seq/seq2seq.py#L403-L411
So here we see when doing pure prediction, we don’t use true labels ys
, but when computing loss we use, which I think will bias the validation loss. Are there any reasons for computing loss like this?
Issue Analytics
- State:
- Created 6 years ago
- Comments:7 (7 by maintainers)
Top Results From Across the Web
Identifying Bugs and Bug Bites - WebMD
Learn to identify ticks, poisonous spiders, fleas, chiggers, and other bugs in this WebMD slideshow. See what their bites and stings look ...
Read more >A Suspicious Insect
Is a little-known predator insect that lives its life underwater in the tropics the cause of an outbreak of a mysterious flesh-eating ...
Read more >How to Identify a Bed Bug Infestation
Most people become suspicious of a bed bug infestation when they find unexplained bites on their bodies. Most commonly a per- son will...
Read more >CWE-546: Suspicious Comment (4.9) - MITRE
Many suspicious comments, such as BUG, HACK, FIXME, LATER, LATER2, TODO, in the code indicate missing security functionality and checking.
Read more >Remove malware or unsafe software - Android - Google Support
Malware is unsafe or unwanted software that may steal personal info or harm your device. You may have malware on your device if:...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
And thanks @alexholdenmiller ! With
dropout=0.0
, the validation loss is roughly the same as training loss.Updated perplexity scoring