question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Contributing trained Greek<->English NMT models implemented with fairseq

See original GitHub issue

Hi there, quick question that I couldn’t answer by searching the docs:

I trained an EL-EN (Greek to English) and an EN-EL machine translation model using the fairseq implementation of the transformer_iwslt_de_en architecture on ~6GB of parallel corpora. Given that the models report a better BLEU score compared to the existing SotA, I would like to share them somehow. I thought that fairseq might offer a huggingface-like way to upload trained models but I couldn’t find any, so I would appreciate any guidance.

If there’s a straightforward way to convert and upload these as huggingface models it would be great!

Many thanks!

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:15 (8 by maintainers)

github_iconTop GitHub Comments

1reaction
stas00commented, Nov 2, 2020

FSMT already does moses+bpe. No pre- or post-processing is required.

0reactions
lighteternalcommented, Nov 8, 2020

Closing this, as it’s solved by by @stas00 in #8374

Read more comments on GitHub >

github_iconTop Results From Across the Web

fairseq: A Fast, Extensible Toolkit for Sequence Modeling
toolkit that allows researchers and devel- opers to train custom models for translation, summarization, language modeling, and other text generation tasks. The ...
Read more >
Command-line Tools — fairseq 0.12.2 documentation
Fairseq provides several command-line tools for training and evaluating models: fairseq-preprocess: Data pre-processing: build vocabularies and binarize ...
Read more >
fairseq - PyPI
Fairseq (-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling ...
Read more >
arXiv:2207.05851v4 [cs.CL] 2 Aug 2022
(NMT). Now based on PyTorch, Sockeye 3 provides faster model implementations and ... and Fairseq are fastest, training models with com-.
Read more >
The Training Process of NMT through the Lens of Classical SMT
Our contributions are as follows: • we show that during training, NMT undergoes the following three stages: ◦ target-side language modeling;.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found