NVIDIA
NVIDIA
BertBaseUncasedForNemo
Model
NVIDIA
NVIDIA
BertBaseUncasedForNemo

BERT Base Model trained on uncased Wikipedia and BookCorpus dataset on a sequence length of 512.

1 Version
1Selected
03/26/2020 8:30 PM UTC511.75 MBAccuracy: 00 EpochsBatch Size: 0GPU: V100
Finetuning Results
KeyValue
GLUE MRPC ACCURACY86.52
GLUE MRPC F190.53
SQUADV1.1 EM82.74
SQUADV1.1 F189.79
SQUADV2.0 EM71.24
SQUADV2.0 F174.32
Pretraining Setup
KeyValue
AMP OPTIMIZATION LEVELO1
BATCH SIZE PER GPU8
LEARNING RATE0.4375E-4
NUMBER OF GPUS8
NUMBER OF ITERATIONS2285714

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.