NVIDIA Deep Learning Examples
NVIDIA Deep Learning Examples
BERT PaddlePaddle checkpoint (Large, Pretraining, AMP, LAMB)
Model
NVIDIA Deep Learning Examples
NVIDIA Deep Learning Examples
BERT PaddlePaddle checkpoint (Large, Pretraining, AMP, LAMB)

BERT Large PaddlePaddle checkpoint pretrained with LAMB optimizer using AMP

1 Version
11/23/2022 1:04 AM UTC5.03 GBAccuracy: 00 EpochsBatch Size: 0GPU: A100
architecture
KeyValue
typeLarge
performance
KeyValue
training_loss1.41
training
KeyValue
global_batch_size_phase232768
global_batch_size_phase165536
iterations_phase17038
LR_phase20.004
LR_phase10.006
iterations_phase21563
training_precisionAMP
bs_phase232
warmup_proportion_phase20.128
bs_phase1256
warmup_proportion_phase10.2843
iterations8601