Old RA2 hparams for some ResNet models, trained in the transition from SGD + RandAugment based 'RA' settings to RMSProp based 'RA2' with Mixup added
The yaml files are for 2x GPU distributed setup, so adjust accordingly for global batch size / LR equivalence.