Model
GNMT TensorFlow checkpoint trained with AMP
Use the NGC CLI to download:
Copied!
Model Overview
The GNMT v2 model is an improved version of the first Google's Neural Machine Translation System with a modified attention mechanism.
Model Architecture
The following image shows the GNMT model architecture:

Training
This model was trained using script available on NGC and in GitHub repo.
Dataset
The following datasets were used to train this model:
- WMT16 English-German - Dataset for machine translation.
Performance
Performance numbers for this model are available in NGC.
References
License
This model was trained using open-source software available in Deep Learning Examples repository. For terms of use, please refer to the license of the script and the datasets the model was derived from.