Flickr30k Finetune results does not match the provided checkpoint
See original GitHub issueHi authors,
I take the provided pretrained 200k checkpoint and did the finetuning of flickr30k. The IR and TR scores after are 64.5 and 81.7. The TR score lower than the one in the paper. My finetuning command is
$PYTHONBIN run.py with data_root=vilt_dataset/ \
num_gpus=8 num_nodes=1 task_finetune_irtr_f30k \
per_gpu_batchsize=4 load_path="weights/vilt_200k.ckpt" \
exp_name="f30k/finetune_official"
I also test the given vilt_irtr_f30k.ckpt
and the results is good, with IR=65.3, TR=83.5. Can I ask what is the process of getting vilt_irtr_f30k.ckpt
?
Issue Analytics
- State:
- Created 2 years ago
- Comments:10 (3 by maintainers)
Top Results From Across the Web
Finetuning BERT on custom data - tensorflow - Stack Overflow
But I am getting shape mismatch with tensor output_bias from checkpoint reader error as the check-point model has 5 classes and my custom ......
Read more >Flickr30k Entities: Collecting Region-to-Phrase ...
Unfortunately, due to a lack of datasets that provide not only paired sentences and images, but detailed grounding of specific phrases in image...
Read more >ELEVATER: A Benchmark and Toolkit for Evaluating ... - arXiv
To establish baseline results on ELEVATER, we evaluate the pre-trained model checkpoints in Table 2. More details of the checkpoints are ...
Read more >UniTAB: Unifying Text and Box Outputs for Grounded Vision ...
The gray tokens in the task-agnostic output sequence are predictions not used ... We report the main “Pre-finetuning” results in Section 4.2, ...
Read more >Lightweight Image Captioning Prompted with Retrieval ...
Preprints and early-stage research may not have been peer reviewed yet. ... finetuned to a downstream task such as image captioning,.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@JACKHAHA363 Sure, you can grab it here https://www.dropbox.com/s/lcqmbx587szaox3/vilt_100k_wwm_pretrain.ckpt?dl=0 (will be expired someday)
Hi, @dandelin I’m sorry to say that the result seems still puzzled. Last night, when I changed precision to 32 during evaluation, two similar but NOT SAME results appeared, which showed one was 0.6480(ir)/0.8370(tr) but the other was 0.6460(ir)/0.8370(tr). Acatlly, seed is exactly fixed to 0. I have no idea what causes the differece. Y_Y