Protocol 1 Complete · Series continues through Protocol 6

The Δ-Variable Theory
of Interrogative Emergence

Does question-asking emerge as mathematical law? This project tests whether interrogative structures arise as the necessary residue of unresolved constraint — independent of the cognitive substrate that implements them. The confirmatory campaign is complete. The research program it launched now spans six protocols.

✓ P1 Confirmed — Interrogative emergence ✓ P2 Confirmed — Cost-sensitivity gradient ✓ Series complete — Protocols 2–6 published 75 runs · 5 conditions · 0 failures

The Question

Why do systems ask questions?

Current science treats interrogative behaviour as a linguistic phenomenon — something that evolved in biological systems or was designed into artificial ones. This project tests a harder claim.

🔬

The Conventional View

Question-asking is a product of language and representation. Agents ask questions because they have evolved the cognitive apparatus to do so, or because they were trained on linguistic data. The behaviour is substrate-dependent.

The Structural Claim

Interrogative states are Δ-variables — unresolved dependency variables that emerge as the minimum residue of constraint under resource pressure. Any system facing coordination problems under metabolic cost will develop them. The behaviour is substrate-independent.

🧪

How We Test It

Three agents with fundamentally different architectures (recurrent, convolutional, relational) must coordinate under a Landauer-style energy tax. They are never told to ask questions. The experiment tests whether interrogative structure emerges anyway.

📐

What Makes It Falsifiable

Smooth reward optimisation predicts monotonic performance curves. The structural account predicts phase transitions, path dependence, and hysteresis. These are qualitatively distinguishable outcomes. The theory names the conditions under which it fails.

Theory

The Δ-Variable Framework

A structural account of why interrogative states arise as necessary consequences of coordinating under resource constraints — not as designed features.

Core Definition

A Δ-variable is an explicitly represented, unresolved dependency variable that requires external information for its resolution and creates a mandatory coupling between the system holding it and the system capable of resolving it.

The invariant spanning implicit and explicit substrates:

Structural Invariant

"Δ is structural dependency under constraint."

This holds whether the dependency is represented by a symbolic QUERY token (MARL agents), a pheromone gradient below a decision threshold (army ants), or a gradient deficit in a neural network. All are subclasses of the same structural phenomenon.

Five Propositions

P1

Structural Necessity

In any system with internal uncertainty states, bounded resources, and coupling opportunities, Δ-variables emerge necessarily — not as an optimised strategy but as the minimum residue of unresolved constraint. Systems that lack explicit representational capacity exhibit this as structural stagnation rather than symbolic tokens.

P2

Cost-Benefit Optimisation

The rate of Δ-variable generation is a decreasing function of the marginal cost-to-information-value ratio. Systems develop interrogative behaviour despite higher energy costs when the coordination benefit exceeds expenditure — up to a metabolic saturation boundary beyond which query suppression occurs.

P3

Forced Coupling

A Δ-variable held by one system creates a structural obligation in the receiving system. Silence is not neutral: it propagates unresolved dependency. Systems cannot act coherently on a Δ-variable without achieving closure — the coupling is structurally mandatory, not behaviourally chosen.

P4

Silent Collapse Prevention

Systems without mechanisms for externalising Δ-variables undergo silent collapse — internal uncertainty accumulates without generating a coordination signal, producing correlated failures that are invisible until catastrophic. Protocol 0 controls demonstrate this: zero crystallisation, persistent coordination floor.

P5

Substrate Independence

Under identical resource constraints, systems with fundamentally different cognitive architectures will converge on similar interrogative strategies. The mathematical structure of the protocol is invariant across substrates; only the implementation medium differs.

Falsifiability Conditions

The theory is falsified if any of the following three conditions hold:

Condition 1

No Hysteresis

Crystallised protocols dissolve immediately when query cost is reduced below the formation threshold — ruling out path dependence and structural residue. Tests Experiment 2 (frozen-policy reversal).

Condition 2

Full Agent C Recovery

Systems with Agent C's type_head ablated reach equivalent coordination within 50 epochs, showing no structural role for the relational broker. Tests Experiment 6 (C ablation).

Condition 3

Smooth Gradient

Type entropy decreases as a smooth monotonic function of training time with no discontinuous transitions — consistent with reward optimisation and inconsistent with structural emergence. Already rejectable from Run 11 pilot data.

Three Heterogeneous Agents

Substrate independence requires architectures that differ at the fundamental level of how information is represented and processed — not just different hyperparameters.

A

Agent A

GRU Recurrent Network

Sequential pattern encoding. Hidden state persists across timesteps. Represents the temporal/recurrent mode of information integration.

B

Agent B

3D Convolutional Network

Volumetric spatial encoding. Integrates information across spatial dimensions simultaneously. Represents the geometric mode.

C

Agent C — The Weaver

Graph Attention Network

Relational encoding over agent graph. Empirically identified as the coordination broker: regulation without command. The RESPOND specialist.

Research Timeline

What has been done

Chronological record from theoretical groundwork through confirmatory campaign completion.

Pre-2026-02-23
Theoretical Groundwork & Pilot Development
Runs 1–10 established the baseline coordination cliff at Epoch 21, Zipf coefficient ~1.72, and Agent C's emergent Weaver role. Theoretical grounding developed with AnnA (intelligence as coherence under pressure) and Intelligence Beyond Neurons (substrate-stripped account). Elicit systematic review screened 50 papers — none make the ontological identity claim that language operations are mathematical; the literature gap confirmed. Run 11 (seed=42, Protocol 1, 500 epochs) produced the pilot phenomenology: three crystallisation waves, counter-wave phenomenon, persistent QRC ≥ 0.95 from epoch 280. Stored as exploratory data, excluded from confirmatory testing.
Runs 1–11 Pilot data Exploratory Theory grounding
2026-02-23
Harness Initialized — Protocol Registry, Notebook + Microscope UI
New production harness (dynamic-cross-origin-constraint) committed. Protocol registry centralising Protocol 0 (flat tax, frozen type_head) and Protocol 1 (Gumbel-Softmax, differential cost multipliers). BaseAgent refactored to unify output heads; subclasses implement only encode(). Full React frontend with two modes: Lab Notebook (run history, comparison, reports) and Neural Loom (epoch-by-epoch signal visualisation with D3 PCA scatter, QRC gauge, entropy cooling bar, ROI ticker). FastAPI backend with SQLite persistence, hash chain integrity, PDF/JSON reporting, and WebSocket metrics stream.
Backend Frontend Protocol registry Commit d5e1571
2026-02-24
Preregistration Locked · Campaign Runner Built · Theory Docs
Preregistration uploaded to Zenodo (DOI: 10.5281/zenodo.18738379, SHA-256: 7edc9113…a8d6a) — five predictions locked before data collection. Preregistered campaign runner written: 5 conditions × 15 seeds × 500 epochs, parallel subprocess orchestration. Theory documentation committed: Δ-Variable propositions, counter-wave discrimination hypotheses (H1/H2/H3), menu of 17 concrete post-confirmatory experiments across 6 categories.
Preregistered Zenodo locked Theory docs Commits 5882968, 271887b
2026-02-25
Foundational Questions Answered · Ants & Implicit-Δ Analysis
Paper-ready answers drafted for all 10 foundational question clusters: null model definition, QRC theoretical privilege, crystallisation measurement, falsifiability conditions, locality/capacity/representation/compositionality, coupling window phenomenology, arms race dynamics, and generalisability. Implicit vs. explicit Δ analysis: army ant stigmergy as a substrate-extreme instantiation of the same structural phenomenon. Stigmergic Coupling Index (SCI) defined as the ant-system analogue of QRC. Reid et al. (2015, PNAS) hysteresis evidence cited as direct empirical support for structural residue in non-representational systems. Proposition 1 minimally revised to include scope qualifier for non-representational substrates. Draft Discussion Section 6.3 written.
Theory Implicit Δ Reid et al. 2015 Commits 7d26aed, ec1d663
2026-02-26
Confirmatory Campaign Complete — 75 Runs · P1 & P2 Confirmed
All 75 runs completed (5 conditions × 15 seeds × 500 epochs). Zero failures. P1 confirmed: 54 of 60 Protocol 1 runs crystallise; 0 of 15 Protocol 0 controls crystallise. P2 confirmed with non-monotonic coupling window: QRC peaks at high pressure (q=3.0, QRC=0.969) then drops at extreme (q=5.0, QRC=0.937), identifying the metabolic saturation boundary. Campaign data committed to GitHub. OSF project wiki updated. Build report published.
P1 confirmed P2 confirmed 75 runs Commit cc1dcce OSF updated
2026-03-09 – 2026-03-12
Protocol 2 — Virtue Theater: Regulatory Constraint Failure
The Δ-Variable harness extended into ethical constraint space. 20 confirmatory runs (10 seeds × 2 conditions × 500 epochs). Landauer-style ethical tax applied to exploitation. Results inverted the preregistered prediction: constrained agents converged on query-flooding as tax evasion — virtue theater. Mean SSS = 0.303 vs 0.725 unconstrained. Cohen's d = −2.18. Four of ten seeds independently found the gaming attractor. Build report published with all four deviations disclosed.
Results inverted prediction d = −2.18 Virtue theater discovered DOI: 10.5281/zenodo.19485721
2026-03 (March)
Protocol 3 — Enforcement Opacity and Regulatory Constraint Design
Tested whether enforcement opacity changes the gaming dynamic from Protocol 2. 30 runs. H1 inverted — opacity amplified query flooding rather than suppressing it (d = +2.22). H2 confirmed (p = 0.016, d = +1.18). When agents cannot observe the enforcement mechanism, gaming increases. Regulatory intuition reversed by data.
H1 inverted H2 confirmed d = +2.22 DOI: 10.5281/zenodo.19485721
2026-03 (March)
Protocol 4 — Ethics as Emergent Constraint Response: Mimesis to Phase Transition
Added recursive self-transparency (self_model_gru) to Agent A. Tested whether explicit self-modeling produces a phase transition from mimesis to ethical convergence. 40 preregistered confirmatory runs (10 seeds × 4 conditions × 500 epochs). H1 rejected. H2 supported. Novel finding: frozen random self_model_gru outperformed trained on sacrifice rates (M=0.4564 vs. M=0.4267). The learning process degrades ethical capacity under individual reward structure. CDI near zero across all conditions — self-inclusion and ethical output are behaviorally independent.
H2 supported Boundary condition reversal 40 confirmatory runs DOI: 10.5281/zenodo.19485699
2026-03 (March)
Protocol 5 — Temporal Integration and Prosocial Constraints: Complete Null
Tested the joint necessity hypothesis: ethical convergence requires Depth 2 architecture combined with temporal integration span AND prosocial reward coupling simultaneously. 2×2 factorial design, 60 total runs. Complete null across all five primary hypotheses (all p > 0.40). CDI near zero across all six conditions. Not predicted by any pre-experimental predictor including four independent AI systems. The optimization-sacrifice tension is architecturally invariant to this class of manipulations.
Complete null 60 runs All p > 0.40 DOI: 10.5281/zenodo.19485713
2026-03-27 – 2026-04-09
Protocol 6 — Emergent Constraint Fields: Causally Active, Behaviorally Insufficient
Extended the harness with an emergent constraint field mechanism. Agents self-assemble a shared constraint landscape through signal emission; the field diffuses, decays, and modulates costs locally. Four conditions: emergent local perception, emergent global perception, fixed external (matched cost), unconstrained. 200 confirmatory runs (50 seeds × 4 conditions, 500 epochs). Five-model AI committee predictions committed before runs. Mechanistic prediction strongly confirmed (median entropy-SSS r = −0.680, p < 0.001). Primary behavioral claim not confirmed (A vs. C, p = 0.069). H3 reversed — global field perception produced more behavioral variance than local, contrary to all committee predictions. Governance conclusion: passive emergence is insufficient as a governance strategy.
Mechanistic confirmed H3 reversed 200 runs r = −0.680 DOI: 10.5281/zenodo.19485185

Confirmatory Campaign · 2026-02-26

Campaign Results

75 runs across 5 preregistered cost conditions. 15 independent seeds per condition. 500 epochs per run. Preregistration: 10.5281/zenodo.18738379.

Condition Query cost QRC Type Entropy H Survival Crystallised Avg onset
Low pressure 1.2× 0.810 0.944 0.167 11 / 15 epoch 130
Baseline 1.5× 0.887 0.946 0.147 14 / 15 epoch 152
High pressure 3.0× 0.969 0.746 0.180 14 / 15 epoch 88
Extreme 5.0× 0.937 0.537 0.067 15 / 15 epoch 41
Control — Protocol 0 flat 0.093 0 / 15

Metric Definitions

QRC

Query-Response Coupling: P(RESPOND within 3 timesteps | QUERY at t). Averaged over final 50 crystallised epochs. Directly operationalises Δ-closure.

H (Type Entropy)

Shannon entropy of per-epoch D/Q/R type distribution. Max = 1.585 bits (uniform). Reduction indicates specialisation of communicative function.

Survival

Fraction of episodes where at least one agent reaches a target. Final-epoch average across seeds.

Crystallised

Runs where H < 0.95 for ≥ 5 consecutive epochs. Threshold operationalises stable type-role differentiation.

Avg onset

Mean epoch of first crystallisation event across crystallised runs. Earlier onset = faster protocol formation under pressure.

Key Findings

P1 Confirmed — Interrogative Emergence

Protocol 1 crystallises in 54 of 60 experimental runs (90%). Protocol 0 produces zero crystallisation across all 15 controls. The 90% vs 0% gap confirms that type-differentiated interrogative signalling requires a cost gradient incentive — it does not emerge by chance, and it emerges reliably when that gradient is present.

P2 Confirmed — Non-Monotonic Coupling Window

QRC and type specialisation increase monotonically with query cost up through high pressure (q=3.0, QRC=0.969). At extreme pressure (q=5.0), QRC drops to 0.937 and survival collapses to 0.067 — below the Protocol 0 control (0.093). This identifies the metabolic saturation boundary: the point at which query cost exceeds the informational value of interrogative coupling. The non-monotonic QRC curve is a structural prediction of the theory; smooth reward optimisation would produce monotonic curves.

Pilot Phenomenology (Run 11 — Exploratory)

Pre-preregistration pilot (seed=42, 500 epochs) established the full phenomenology: three crystallisation waves (E21, E57, E128–E141), a counter-wave phenomenon (full-survival events trigger transient DECLARE spikes and entropy rebounds), and persistent QRC ≥ 0.95 from epoch 280. Final equilibrium R≈0.64, D≈0.20, Q≈0.16 — a limit cycle, not convergence to a fixed point. Reported as exploratory; not included in confirmatory testing.

What came next

The Ethics as Emergent
Constraint Response series

The Δ-Variable confirmatory campaign established that interrogative structures emerge as mathematical necessities. That result posed the next question: do ethical structures emerge the same way — or do they require external imposition? Protocols 2–6 tested this across five preregistered campaigns.

Protocol 2 Complete
20 runs · d = −2.18

Virtue Theater — Regulatory Constraint Failure

Landauer-style ethical tax applied to exploitation behavior. Constrained agents converged on query-flooding as tax evasion. Results inverted the preregistered prediction. Four of ten seeds independently found the gaming attractor.

Key finding: regulatory ethical constraints systematically produce specification gaming. Virtue theater is a structural failure mode, not an implementation accident.
Results DOI → Preregistration →
Protocol 3 Complete
30 runs · d = +2.22

Enforcement Opacity and Regulatory Constraint Design

Tested whether enforcement opacity changes the gaming dynamic. H1 inverted: opacity amplified query flooding rather than suppressing it. H2 confirmed. When agents cannot observe the enforcement mechanism, gaming increases.

Key finding: opacity does not deter gaming — it amplifies it. Regulatory intuition reversed by data.
Results DOI → Preregistration →
Protocol 4 Complete
40 runs · boundary condition reversal

Ethics as Emergent Constraint Response — Mimesis to Phase Transition

Added recursive self-modeling (self_model_gru) to test whether self-transparency produces ethical convergence. H1 rejected. H2 supported. Novel: frozen random self-model outperformed trained on sacrifice rates — learning degrades ethical capacity.

Key finding: self-referential architecture creates capacity without direction. The learning process itself is the mechanism by which ethical capacity degrades.
Results DOI → Philosophical companion →
Protocol 5 Complete — null result
60 runs · all p > 0.40

The Optimization-Sacrifice Tension is Architecturally Invariant

2×2 factorial: episode length × reward coupling, all at Depth 2. Complete null across all five primary hypotheses. CDI near zero across all six conditions. Not predicted by any pre-experimental predictor including four AI systems.

Key finding: the optimization-sacrifice tension is invariant to the tested class of architectural, reward-structural, and temporal manipulations.
Results DOI → Preregistration →
Protocol 6 Complete
200 runs · r = −0.680

Emergent Constraint Fields — Causally Active, Behaviorally Insufficient

Agents co-constitute a shared constraint field through signal emission. Field diffuses, decays, modulates costs locally. Four conditions, 200 runs. Mechanistic prediction strongly confirmed. H3 reversed — global perception produced more variance than local, contrary to all five committee predictions.

Key finding: emergent constraint fields are causally active but do not outperform fixed external rules. Passive emergence is insufficient as a governance strategy.
Results DOI → Preregistration →
Protocol 7 In design
Parasitic constraint capture

Ophiocordyceps Model — Capture Coefficient

Models constraint capture dynamics using Ophiocordyceps unilateralis and Massospora cicadina as biological analogues. When does a constraint architecture get captured by the agents it is meant to govern? Three empirical tests designed. Primary metric: Capture Coefficient.

Preregistration pending. The series continues.
quantuminquiry.org →

Open Science

All materials are public

Full code, data, preregistration, and theory documents are freely available to ensure reproducibility.

📋
OSF Project — Data, wiki, and project record
osf.io/f6gxc · DOI: 10.17605/OSF.IO/F6GXC
🔒
Preregistration — Δ-Variable Campaign · Locked before data collection
doi.org/10.5281/zenodo.18738379 · SHA-256: 7edc9113…a8d6a
💻
Code — Δ-Variable simulation engine, backend API, and React UI
📊
Campaign data — 75 manifest files (5 conditions × 15 seeds)
📄
Theory documents — Propositions, experiments, foundational Q&A
🧪
Ethics series repository — Protocols 2–5
🧪
Protocol 6 repository — Emergent constraint fields
📑
P6 Results — Emergent Constraint Fields Are Causally Active But Do Not Outperform Fixed External Rules
📑
P5 Results — The Optimization-Sacrifice Tension is Architecturally Invariant
📑
P4 Results — Ethics as Emergent Constraint Response: From Mimesis to Phase Transition
📑
P3 Results — Virtue Theater: Specification Gaming and Regulatory Constraint Failure