question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Hi guys, I have been trying to run the Bing experiment but it seems I can’t for now.

"datasets": {
--
  | "wiki_pretrain_dataset": "/data/bert/bnorick_format/128/wiki_pretrain",
  | "bc_pretrain_dataset": "/data/bert/bnorick_format/128/bookcorpus_pretrain"
  | },

I see this stuff is missing to fully validate the code.

Issue Analytics

  • State:open
  • Created 4 years ago
  • Comments:28 (11 by maintainers)

github_iconTop GitHub Comments

6reactions
jeffracommented, Jun 9, 2020

Hi @Rachnas and @tomekrut, we have uploaded our pre-processing script for the raw bookcorpus and wikipedia datasets to get them into our numpy compatible format. We haven’t written up a tutorial yet on how to use them but feel free to check out the script here: https://github.com/microsoft/DeepSpeedExamples/blob/jeffra/bert_preprocessing/bing_bert/turing/bert_pretrain_data.py

4reactions
tjruwasecommented, Jul 24, 2020

@piyushghai We are pleased to announce that support for training Bing BERT with Nvidia dataset, #27. Please give it a try.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Bing is Now Utilizing BERT at a Larger Scale Than Google
Bing revealed today that it has been using BERT in search results before Google, and it's also being used at a larger scale....
Read more >
Bing delivers its largest improvement in search experience ...
A distilled three-layer BERT model serving latency on twenty CPU cores was initially benchmarked at 77ms per inference. However, since these ...
Read more >
Bing says it has been applying BERT since April
Bing has been using BERT to improve the quality of search results since April, Microsoft has stated. The transformer models are now applied ......
Read more >
Hey; Bing Has Been Doing The BERT Thing Since April
First, Google invented BERT, Bing just added it to its search this summer. Second, BERT is not the only NLP ML-model Google created...
Read more >
Bing Says It Is Applying BERT Since April 2019 | Infidigit
Bing Says it is Utilizing a BERT-like Model for Every Search Query Globally ... Bing mentioned that it is using a large transformer...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found