Averaging Weights Leads to Wider Optima and Better Generalization – PaperGrep https://papergrep.dev/paper/averaging-weights-leads-to-wider-optima-and-28dae7