EvidenceEmerging evidencev1.10.0
From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents
Evidence card
- Claim
- Persistent memory can turn untrusted interaction into long-lived influence over future behavior.
- Evidence level
- Emerging evidence
- Source
- https://arxiv.org/abs/2606.04329
- Publication date
- 2026-06-03
- Authors or institution
- Pritam Dash, Tongyu Ge, Aditi Jain, Tanmay Shah, Zhiwei Shang
- System tested
- LLM-based agents with memory-write channels and MPBench as reported.
- Limitations
- Very recent preprint; benchmark representativeness and defenses need review.
- What the evidence does show
- Persistent memory can turn untrusted interaction into long-lived influence over future behavior.
- What the evidence does not show
- That all memory systems are equally vulnerable or that cleanup is impossible.
- Date last reviewed in UTC
- 2026-06-26T00:00:00Z
Site use
This source supports Cognivirus.com pages related to memory poisoning, persistent memory, agent security. Its role is bounded by the limitations listed above.