Created by: suchenzang
This PR removes *normalize_before flags and defaults to pre-norm logic throughout.
Part of https://github.com/facebookresearch/metaseq/issues/368 efforts.
Created by: suchenzang
This PR removes *normalize_before flags and defaults to pre-norm logic throughout.
Part of https://github.com/facebookresearch/metaseq/issues/368 efforts.