This model can be used for translating text in source language (En) to a text in target language (Hi).
The model is based on Transformer "Big" architecture originally presented in "Attention Is All You Need" paper . In this particular instance, the model has 12 layers in the encoder and 2 layers in the decoder. It is using YouTokenToMe tokenizer .
These models were trained on a collection of many publicly available datasets comprising of millions of parallel sentences. The NeMo toolkit  was used for training this model over roughly 200k steps.
While training this model, we used the following datasets:
We used the YouTokenToMe tokenizer  with separate encoder and decoder BPE tokenizers.
The accuracy of translation models are often measured using BLEU scores .
On WMT14 Test set this model achieves 20.9 BLEU score measured using SacreBLEU package . BLEU+case.mixed+lang.en-hi+numrefs.1+smooth.exp+test.wmt14+tok.13a+version.1.5.1 = 20.9 55.1/28.5/15.7/8.9 (BP = 0.966 ratio = 0.967 hyp_len = 58865 ref_len = 60872)
The model is available for use in the NeMo toolkit , and can be used as a pre-trained checkpoint for inference or for fine-tuning on another dataset.
import nemo import nemo.collections.nlp as nemo_nlp nmt_model = nemo_nlp.models.machine_translation.MTEncDecModel.from_pretrained(model_name="nmt_en_hi_transformer12x2")
python [NEMO_GIT_FOLDER]/examples/nlp/machine_translation/nmt_transformer_infer.py --model=nmt_en_hi_transformer12x2.nemo --srctext=[TEXT_IN_SRC_LANGUAGE] --tgtout=[WHERE_TO_SAVE_TRANSLATION] --target_lang hi --source_lang en
This translate method of the NMT model accepts a list of de-tokenized strings.
The translate method outputs a list of de-tokenized strings in the target language.
 Vaswani, Ashish, et al. "Attention is all you need." arXiv preprint arXiv:1706.03762 (2017).