question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Reproduce the results on CoLA

See original GitHub issue

I try to reproduce the CoLA results reported in the BERT paper but the numbers are far from the reported one. My best mcc (BERT large) for dev is 64.79% and the test result is 56.9% while the reported test result is 60.5%. The learning rate is 2e-5 and the total number of epochs is 5. For BERT base,the result is also lower by 3-5%.

As the paper said, for BERTLARGE we found that fine-tuning was sometimes unstable on small data sets (i.e., some runs would produce degenerate results), so we ran several random restarts and selected the model that performed best on the Dev set.

I also tried several restarts with different learning rates and random seeds but it seems no improvement. I’m quite confused for the reproduction. Any suggestions would be greatly appreciated.

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Comments:6 (2 by maintainers)

github_iconTop GitHub Comments

2reactions
Iwontbecreativecommented, Apr 5, 2019

Cola is probably one of the most unstable tasks for BERT. For us it mostly boiled down to running many seeds. If all you care about is a good pre-trained model checkpoint, we have a 65 / 61 run at https://github.com/zphang/bert_on_stilts

0reactions
bgg11117commented, Jul 22, 2019

Hi @cooelf, what parameter number did you change in order to fit a better result, thanks!

Read more comments on GitHub >

github_iconTop Results From Across the Web

What Is a Cost-of-Living Adjustment (COLA), and How Does It ...
A cost-of-living adjustment (COLA) is made to Social Security and Supplemental Security Income to adjust benefits to counteract the effects of inflation.
Read more >
cola: an R/Bioconductor package for consensus partitioning ...
cola provides a complete set of tools for comprehensive subgroup analysis, including partitioning, signature analysis, functional enrichment, as ...
Read more >
Effects of cola intake on fertility: a review
In this review, we introduce the cola effects on reproduction including pregnancy miscarriages, ovulatory and menstrual disorders, and reduced semen quality.
Read more >
Reproduction Coca-Cola Thermometers for sale | eBay
Get the best deals on Reproduction Coca-Cola Thermometers when you shop the largest online selection at eBay.com. Free shipping on many items |...
Read more >
Representing Coca-cola input-output variables - ResearchGate
Download scientific diagram | Representing Coca-cola input-output variables from ... predict, and reproduce the finished product's quality; hence, ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found