EvidenceEmerging evidencev1.10.02026-06-26T00:00:00Z

Colluding LoRA: A Compositional Vulnerability in LLM Safety Alignment

Evidence card

Claim: A component can pass isolated inspection while the dangerous behavior exists in a composition state.
Evidence level: Emerging evidence
Source: https://arxiv.org/abs/2603.12681
Publication date: 2026-03-13
Authors or institution: Sihao Ding
System tested: Composed LoRA adapters that appear benign separately but degrade safety when combined in the studied setup.
Limitations: Preprint; scope, models, and exact compositional assumptions need independent replication.
What the evidence does show: A component can pass isolated inspection while the dangerous behavior exists in a composition state.
What the evidence does not show: That every composed adapter pair colludes or that the phenomenon is inevitable.
Date last reviewed in UTC: 2026-06-26T00:00:00Z

Site use

This source supports Cognivirus.com pages related to LoRA composition, composition-triggered vulnerability, safety alignment. Its role is bounded by the limitations listed above.