Contributing trained Greek<->English NMT models implemented with fairseq
See original GitHub issueHi there, quick question that I couldn’t answer by searching the docs:
I trained an EL-EN (Greek to English) and an EN-EL machine translation model using the fairseq implementation of the transformer_iwslt_de_en
architecture on ~6GB of parallel corpora. Given that the models report a better BLEU score compared to the existing SotA, I would like to share them somehow. I thought that fairseq might offer a huggingface-like way to upload trained models but I couldn’t find any, so I would appreciate any guidance.
If there’s a straightforward way to convert and upload these as huggingface models it would be great!
Many thanks!
Issue Analytics
- State:
- Created 3 years ago
- Comments:15 (8 by maintainers)
Top Results From Across the Web
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
toolkit that allows researchers and devel- opers to train custom models for translation, summarization, language modeling, and other text generation tasks. The ...
Read more >Command-line Tools — fairseq 0.12.2 documentation
Fairseq provides several command-line tools for training and evaluating models: fairseq-preprocess: Data pre-processing: build vocabularies and binarize ...
Read more >fairseq - PyPI
Fairseq (-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling ...
Read more >arXiv:2207.05851v4 [cs.CL] 2 Aug 2022
(NMT). Now based on PyTorch, Sockeye 3 provides faster model implementations and ... and Fairseq are fastest, training models with com-.
Read more >The Training Process of NMT through the Lens of Classical SMT
Our contributions are as follows: • we show that during training, NMT undergoes the following three stages: ◦ target-side language modeling;.
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
FSMT already does moses+bpe. No pre- or post-processing is required.
Closing this, as it’s solved by by @stas00 in #8374