AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration – PaperGrep https://papergrep.dev/paper/awq-activation-aware-weight-quantization-for-llm-5a9788