question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Flickr30k Finetune results does not match the provided checkpoint

See original GitHub issue

Hi authors,

I take the provided pretrained 200k checkpoint and did the finetuning of flickr30k. The IR and TR scores after are 64.5 and 81.7. The TR score lower than the one in the paper. My finetuning command is

$PYTHONBIN run.py with data_root=vilt_dataset/ \
        num_gpus=8 num_nodes=1 task_finetune_irtr_f30k \
        per_gpu_batchsize=4 load_path="weights/vilt_200k.ckpt" \
        exp_name="f30k/finetune_official" 
Screen Shot 2021-06-15 at 12 53 24 PM

I also test the given vilt_irtr_f30k.ckpt and the results is good, with IR=65.3, TR=83.5. Can I ask what is the process of getting vilt_irtr_f30k.ckpt?

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:10 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
dandelincommented, Jun 16, 2021
0reactions
byougertcommented, Dec 31, 2021

Hi, @dandelin I’m sorry to say that the result seems still puzzled. Last night, when I changed precision to 32 during evaluation, two similar but NOT SAME results appeared, which showed one was 0.6480(ir)/0.8370(tr) but the other was 0.6460(ir)/0.8370(tr). Acatlly, seed is exactly fixed to 0. I have no idea what causes the differece. Y_Y

Read more comments on GitHub >

github_iconTop Results From Across the Web

Finetuning BERT on custom data - tensorflow - Stack Overflow
But I am getting shape mismatch with tensor output_bias from checkpoint reader error as the check-point model has 5 classes and my custom ......
Read more >
Flickr30k Entities: Collecting Region-to-Phrase ...
Unfortunately, due to a lack of datasets that provide not only paired sentences and images, but detailed grounding of specific phrases in image...
Read more >
ELEVATER: A Benchmark and Toolkit for Evaluating ... - arXiv
To establish baseline results on ELEVATER, we evaluate the pre-trained model checkpoints in Table 2. More details of the checkpoints are ...
Read more >
UniTAB: Unifying Text and Box Outputs for Grounded Vision ...
The gray tokens in the task-agnostic output sequence are predictions not used ... We report the main “Pre-finetuning” results in Section 4.2, ...
Read more >
Lightweight Image Captioning Prompted with Retrieval ...
Preprints and early-stage research may not have been peer reviewed yet. ... finetuned to a downstream task such as image captioning,.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found