NVIDIA
NVIDIA
Riva Megatron NMT any2any
Model
NVIDIA
NVIDIA
Riva Megatron NMT any2any

Machine Translation model for Any to Any direction

Sign in to access all content for this ModelSigning in will also allow download accessSign In

Machine Translation: Multilingual 1.6B Any-Any NMT Model - Model Overview

Description:

The Megatron Multilingual 1.6B Neural Machine Translation model translates text in any to any directions across the 37 supported languages, including non-English centric translation (such as French to Chinese, etc). The Supported languages are: English(en), Czech(cs), Danish (da), German(de), Greek(el), European Spanish(es-ES), LATAM Spansish(es-US), Finnish(fi), France(fr), Hungarian(hu), Italian(it), Lithuanian(lt), Latvian(lv),Dutch(nl), Norwegian(no), Polish(pl), European Portuguese(pt-PT), Brazillian Portuguese(pt-BR), Romanian(ro), Russian(ru), Slovak(sk), Swedish(sv), Simplified Chinese(zh-CN), Traditional Chinese(zh-TW), Japanese(ja), Hindi(hi), Korean(ko), Estonian(et), Slovenian(sl), Bulgarian(bg), Ukrainian(uk), Croatian(hr), Arabic(ar), Vietnamese(vi), Turkish(tr), Indonesian(id), Thai(th). This model is ready for commercial use.

Model Architecture

Architecture Type: Transformer

Network Architecture: Megatron

The model is based on Transformer architecture originally presented in "Attention Is All You Need" paper [1]. In this particular instance, the model has 24 layers in the encoder and 24 layers in the decoder. It is using SentencePiece tokenizer [2].

Input:

Input Type(s): Text String
Input Format(s): List

Other Properties Related to Input: No Pre-Processing Needed; No Tokenization required; 1024 Character Text String Limit (No non-textual characters)

Output:

Output Type(s): Text String
Output Format: List
Output Parameters: Selected Language
Other Properties Related to Output: Outputs are not tokenized or processed to hide sensitive input information

References:

[1] Vaswani, Ashish, et al. "Attention is all you need." arXiv preprint arXiv:1706.03762 (2017). [2] https://github.com/google/sentencepiece [3] https://en.wikipedia.org/wiki/BLEU [4] https://github.com/mjpost/sacreBLEU [5] NVIDIA NeMo Toolkit

Software Integration:

Runtime Engine(s): [Riva 2.18.0]

Supported Hardware Platform(s):

  • NVIDIA Ampere
  • NVIDIA Hopper
  • NVIDIA Jetson
  • NVIDIA Lovelace
  • NVIDIA Turing
  • NVIDIA Volta

Supported Operating System(s):

  • Linux
  • Linux 4 Tegra

Model Version(s):

rmir_nmt_megatron_1b_any_any:2.18.0

Training & Evaluation Dataset:

** Data Collection Method by dataset

  • [Human]

** Labeling Method by dataset

  • [Automated]

Performance of the models

The performance of the model from Any -> Any direction for Flores-101 dataset

|-----------|-------|-------|-------|-------|-------|-------|-------|

Languagesdees-eses-usfrjaruzh-cn
de-24.5024.1039.3027.3026.1033.30
es-es22.10--30.3023.5020.2029.80
es-us22.10--30.3023.5020.2029.80
fr2524.8030.40-26.6025.5032.70
ja16.9016.4018.1023.70-15.2028.90
ru22.4021.9026.4033.4025.40-30.90
zh-cn17.5017.3019.1025.6016.8023.70-
------------------------------------------------------------

The performance of any->en and en->any direction for Flores-101 dataset

LanguageEng -> LanguageLanguage -> Eng
Czech32.9041.10
Danish46.2049.60
German38.2045.20
Greek27.5036.50
European Spanish27.6030.70
Latin American Spanish26.8030.70
Finnish22.7035
French50.5046.50
Hungarian26.7036.90
Italian29.9034.50
Lithuanian27.5035.10
Latvian31.0037.00
Dutch26.7032.60
Norwegian34.0044.80
Polish20.8030.30
European Portugese48.1050.50
Brazil Portugese49.8050.50
Romanian40.7045.00
Russian31.3036.10
Slovak3540.60
Swedish45.0049.60
Simplified Chinese39.5028.50
Traditional Chinese30.8026.80
Japanese32.5026.70
Hindi33.5039.90
Korean28.0029.50
Estonian27.3038.90
Slovenian30.7036.20
Bulgarian41.8042.10
Ukrainian30.7040.20
Croatian27.9037.80
Arabic2840.60
Vietnamese41.8036.90
Turkish29.5038.80
Indonesian47.2044.90
Thai30.9028.10

Inference:

Engine: Triton

Test Hardware:

  • NVIDIA Volta V100
  • NVIDIA Turing T4
  • NVIDIA A100 GPU
  • NVIDIA A30 GPU
  • NVIDIA A10 GPU
  • NVIDIA H100 GPU
  • NVIDIA L4 GPU
  • NVIDIA L40 GPU
  • NVIDIA Jetson Orin
  • NVIDIA Jetson AGX Xavier
  • NVIDIA Jetson NX Xavier

Ethical Considerations:

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their supporting model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards. Please report security vulnerabilities or NVIDIA AI Concerns here.

Publisher
NVIDIA
NVIDIA
Latest Version2.18.0
UpdatedMarch 7, 2025 UTC
Compressed Size2.96 GB

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.