SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models – PaperGrep https://papergrep.dev/paper/smoothquant-accurate-and-efficient-post-training-cb2dca