>>106333892
If you keep the learning rate fixed, just increasing the batch size will slow down training proportionally. Picrel is a short test, BS1 vs BS2, with the same LR.