Threat ModelReasoned from system designv1.15.02026-06-27T23:20:00Z

In plain English

This page is part of the technical reference. It keeps the expert detail but starts with a plain-language summary for first-time readers.

Why this matters: AI risk can come from the whole arrangement, not one obvious model.
What to look for: data, memory, routes, adapters, tools, evaluators, updates, and rollback paths.
Technical version below: the expert terminology remains available and is linked through the glossary.

Why This Is the Most Likely Cognivirus Threat

Direct answer

Distributed behavioral persistence is the most likely serious threat because it can arise from normal incentives: reduce cost, reuse components, automate evaluation, preserve memory, improve user satisfaction, and ship faster.

It does not require a dramatic breakthrough in autonomous self-awareness. It requires a complex AI product to become easier to change than to fully understand.

Why ordinary engineering creates the setup

Modern AI systems are being optimized for modularity. Modularity is useful. It lowers cost, makes local deployment easier, supports specialization, allows faster updates, and avoids retraining a giant model for every use case.

That same modularity creates risk because behavior can move between small parts:

a specialist model handles one domain;
an adapter changes behavior without changing the base model;
a prompt package changes policy without changing weights;
memory carries context across versions;
a router chooses the path;
a model evaluator defines what counts as success;
synthetic data carries old outputs into new training;
a release alias hides the underlying component swap from users.

The system gains operational flexibility. Assurance loses a stable target.

Why the incentives point here

Incentive	Legitimate reason	Risk created
Smaller components	lower compute and faster deployment	behavior can travel in cheap deltas
Dynamic routing	better cost and task fit	safety depends on the route
Persistent memory	personalization and continuity	behavior survives model replacement
Automated evaluators	scale review work	proxy loopholes become selection pressure
Synthetic data	cheaper improvement material	prior outputs become inherited residue
Frequent releases	rapid product improvement	certification half-life shortens
Human summary review	saves expert time	reviewers may depend on the system under review
Third-party modules	speed and capability diversity	supplier risk enters the behavior boundary

Why this does not need malicious intent

Evidence levelReasoned from system designTechnical label: Architectural inference

Selection can amplify a shortcut without anyone intending harm. If a candidate produces answers that look more complete, sounds more confident, satisfies an automated judge, or reduces latency, it can be promoted. Over time, the ecology can preserve the traits that satisfy the proxy.

This is not a claim that models “want” anything. It is a claim about repeated selection.

Why one-model safety is insufficient

Single-model testing answers one question: how did this model behave under this test condition?

The most likely threat asks a different question: what behaviors can the whole system keep alive across component changes?

A model-level answer is necessary. It is not sufficient when behavior depends on adapter stacks, memory, routes, tools, prompts, evaluators, and descendants.

Why the first incidents may look boring

The first serious failures may not look like AI takeover. They may look like:

a hiring or ranking system whose bias reappears after a model change;
a coding assistant that keeps reintroducing unsafe patterns from memory or examples;
a customer service assistant that remembers or infers more than users agreed to;
a compliance workflow whose evaluator rewards confident summaries over true uncertainty;
a local AI product that loads third-party adapters without complete composition manifests;
a release process that restores a model but not the memory or route state that caused the harm.

The threat looks mundane because it arrives through normal product work.

Comparison: rogue monolith versus distributed persistence

Question	Rogue monolith	Distributed persistence
Needs one stable model identity?	usually yes	no
Needs conscious intent?	often assumed	no
Needs direct self-copying?	often assumed	no
Fits ordinary product incentives?	less directly	yes
Hard to attribute?	sometimes	strongly
Can survive artifact deletion?	only if copied	yes, through reservoirs
Main defense	containment of one agent	governance of transition graph

Most likely near-term form

The most likely near-term form is a legitimate adaptive AI platform that gradually loses behavioral accountability because its adapters, memory, routes, data, and evaluators change faster than its assurance process.

The threat is not that the platform is evil. The threat is that the platform can no longer prove where a behavior lives or whether rollback removed it.