EvidenceEmerging evidencev1.10.0
Alignment-Aware Quantization for LLM Safety
Evidence card
- Claim
- Conventional low-perplexity quantization objectives can miss safety degradation in studied models.
- Evidence level
- Emerging evidence
- Source
- https://arxiv.org/abs/2511.07842
- Publication date
- 2025-11-11
- Authors or institution
- Sunghyun Wee, Suyoung Kim, Hyeonjin Kim, Kyomin Hwang, Nojun Kwak
- System tested
- Post-training quantization with alignment-preserving contrastive loss across LLaMA, Qwen, and Mistral families as reported.
- Limitations
- Preprint; method scope and robustness require independent replication.
- What the evidence does show
- Conventional low-perplexity quantization objectives can miss safety degradation in studied models.
- What the evidence does not show
- That all low-bit deployments are unsafe or that AAQ is sufficient in every setting.
- Date last reviewed in UTC
- 2026-06-26T00:00:00Z
Site use
This source supports Cognivirus.com pages related to quantization, alignment preservation, safety metrics. Its role is bounded by the limitations listed above.