CLEVR pretrained model gives FID 22
See original GitHub issueHi, kudos for great work!
I’ve just noticed that with recommended preprocessing and evaluation, the metrics on gdrive:cityscapes work as expected (FID ~5.2), while for CLEVR exactly two same lines:
python prepare_data.py --clevr --max-images 100000
python run_network.py --eval --gpus 0 --expname clevr-exp --dataset clevr --pretrained-pkl gdrive:clevr-snapshot.pkl
give ~22 FID, not 9.2. Can you please double-check if the provided snapshot is correct? Or am I missing smth here?
Thanks in advance!
Issue Analytics
- State:
- Created 2 years ago
- Comments:8 (6 by maintainers)
Top Results From Across the Web
ashkamath/mdetr
This repository contains code and links to pre-trained models for MDETR (Modulated DETR) for pre-training on data having aligned text and images with...
Read more >deep learning - Which is the fastest image pretrained model?
The answer will depend on some things such as your hardware and the image you process. Additional, we should distinguish if you are...
Read more >CLEVR-X: A Visual Reasoning Dataset for Natural ...
The CLEVR dataset [27] was proposed as a diagnostic dataset to inspect the visual reasoning of VQA models. Multiple frameworks have been ...
Read more >Image Recognition with Transfer Learning (98.5%)
Use transfer learning to easily classify dog and cat pictures with a 98.5% accuracy.
Read more >CLEVR: A Diagnostic Dataset for Compositional Language ...
We use CLEVR to analyze a suite of VQA models and discover weaknesses that are not widely known. For exam- ple, we find...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Hi, thank you very much for pointing that out! I’ll check that today. I verified all scores locally so there must be a bug / incorrect snapshot file. Will get back shortly!
Alright I uploaded the data (the difference between the locally processed data that leads to the FID score 12 is that the images ratio height/weight in the data is 2/3 while the pretrained model was trained on images with ratio 3/4. You can try again to run
python prepare_data.py --clevr --max-images 100000
should take a few minutes only (because will just download the data) and then another 20min to run thepython run_network.py --eval --gpus 0 --expname clevr-exp --dataset clevr --pretrained-pkl gdrive:clevr-snapshot.pkl
. Let me know if you’re still getting higher FID!