Created by: suchenzang
Merging MegatronTrainer specific code into Trainer.
Prepping for unifying model_parallel and non model parallel code by matching enc/dec structure.
Created by: suchenzang
Merging MegatronTrainer specific code into Trainer.
Prepping for unifying model_parallel and non model parallel code by matching enc/dec structure.