Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Contributing trained Greek<->English NMT models implemented with fairseq

See original GitHub issue

Hi there, quick question that I couldn’t answer by searching the docs:

I trained an EL-EN (Greek to English) and an EN-EL machine translation model using the fairseq implementation of the transformer_iwslt_de_en architecture on ~6GB of parallel corpora. Given that the models report a better BLEU score compared to the existing SotA, I would like to share them somehow. I thought that fairseq might offer a huggingface-like way to upload trained models but I couldn’t find any, so I would appreciate any guidance.

If there’s a straightforward way to convert and upload these as huggingface models it would be great!

Many thanks!

Issue Analytics

State:
Created 3 years ago
Comments:15 (8 by maintainers)

Top GitHub Comments

1reaction

stas00commented, Nov 2, 2020

FSMT already does moses+bpe. No pre- or post-processing is required.

0reactions

lighteternalcommented, Nov 8, 2020

Closing this, as it’s solved by by @stas00 in #8374

Top Results From Across the Web

fairseq: A Fast, Extensible Toolkit for Sequence Modeling

toolkit that allows researchers and devel- opers to train custom models for translation, summarization, language modeling, and other text generation tasks. The ...

Command-line Tools — fairseq 0.12.2 documentation

Fairseq provides several command-line tools for training and evaluating models: fairseq-preprocess: Data pre-processing: build vocabularies and binarize ...

fairseq - PyPI

Fairseq (-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling ...

arXiv:2207.05851v4 [cs.CL] 2 Aug 2022

(NMT). Now based on PyTorch, Sockeye 3 provides faster model implementations and ... and Fairseq are fastest, training models with com-.

The Training Process of NMT through the Lens of Classical SMT

Our contributions are as follows: • we show that during training, NMT undergoes the following three stages: ◦ target-side language modeling;.