MriModule Validation Loss Logged Incorrectly
See original GitHub issueSteps to reproduce:
- Run “train_unet_demo.py” in fastmri/fastmri_examples/unet
- Preferably with a specified data folder that only has a single data point for train and val, because it is going to error after a full training and validation epoch
- Observe the following error:
pytorch_lightning.utilities.exceptions.MisconfigurationException: ModelCheckpoint(monitor='val_loss') not found in the returned metrics: ['loss', 'validation_loss', 'val_metrics/nmse', 'val_metrics/ssim', 'val_metrics/psnr']. HINT: Did you call self.log('val_loss', tensor) in the LightningModule?
The issue appears to be because MriModule was changed to log 'validation_loss'
instead of 'val_loss'
in the method validation_epoch_end
. Renaming it back to 'val_loss'
fixes the issue. The change happened in this commit.
Issue Analytics
- State:
- Created 3 years ago
- Comments:5 (3 by maintainers)
Top Results From Across the Web
Why my training and validation loss is not changing?
Your weights have diverged during training, and the network as a result is essentially broken. As it consists of ReLUs, I expect the...
Read more >What does it mean when train and validation loss diverge from ...
Possible explanations. Coding error; Overfitting due to differences in the training / validation data; Skewed classes (and differences in ...
Read more >Your validation loss is lower than your training loss? This is why!
During validation and testing, your loss function only comprises prediction error, resulting in a generally lower loss than the training set. Image by...
Read more >Why is my validation loss lower than my training loss?
Reason #2: Training loss is measured during each epoch while validation loss is measured after each epoch.
Read more >Overfit and underfit | TensorFlow Core
TensorBoard to generate TensorBoard logs for the training. ... If the validation metric is going in the wrong direction, the model is clearly...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Okay it’s merged.
Hello, yes that fixed the issue. Thanks!