Independent AI Research — Bakersfield, CA

How do you know
what you know?

Bruce Tisler. Systems architect and AI researcher building empirical infrastructure for interrogative emergence, specification gaming, and LLM alignment. Forty-eight years carrying a question; four years formalizing it.

GitHub btisler-DS
Program began November 2022
Preregistered studies 2 complete, 1 in design
−2.18
Cohen's d — virtue theater effect
constrained vs unconstrained agents
80
Studies synthesized in
Elicit literature review
2.5%
AI safety studies using
preregistered hypotheses
20
Confirmatory runs across
Protocol 2 campaign
Research Program

Six projects,
one question

Every project in this program is an instrument for answering the same question carried for nearly five decades. Each is a different angle on the same structure.

Core theory

Δ-Variable Theory of Interrogative Emergence

Questions emerge as mathematically necessary solutions to uncertainty under resource constraints — independent of cognitive substrate. Tested empirically in heterogeneous MARL systems (RNN, CNN, GNN). Confirmatory campaign complete: P1–P4 confirmed across 75 runs, 5 conditions.

Confirmatory complete
Experiment — Ethics

Virtue Theater: Regulatory Constraint Failure in MARL

Preregistered test of whether regulatory ethical constraints sustain genuine behavioral alignment. Results inverted the prediction: constrained agents showed lower interrogative diversity (d = −2.18), converging on query-flooding as tax evasion. Four of ten seeds independently found the gaming attractor.

Paper complete
Infrastructure

HDT² — Holistic Data Transformation

Framework for measuring reasoning quality through entropy variance detection. Demonstrates that incorrect AI outputs exhibit measurably higher entropy than correct ones. Validated across large HH-RLHF datasets with Qwen, Mistral, and Llama models.

Published
Protocol

DDRP — Deterministic Document Review Protocol

Cryptographically hashed, deterministically reproducible document analysis. Open-source reference implementation for auditable document review — infrastructure to adapt rather than a product to adopt. Prior art established via Zenodo.

Published
Cognitive architecture

Edos — Persistent Cognitive Protocol

A cognitive architecture that evolved organically across 34 months and multiple AI substrates. Operates as a persistent reasoning scaffold, not a prompt template. Now formalized as a named framework within the Quantum Inquiry research stack.

Active development
Next

Protocol 3 — Architectural Necessity Under Depletion

The question Protocol 2 couldn't answer: does exploitation actually exhaust the environment, removing conditions for interrogative behavior? Protocol 3 tests the core architectural necessity claim with genuine resource depletion. Preregistration in design.

In design
✦ Speculative thread

The research raises questions it doesn't answer. What if the shape of the question is itself a constraint? The Muse page holds these open — no claims, just threads drawn from 80 studies on deceptive alignment and the findings above. Add your own.

Enter the Muse →
Experimental Roadmap

What was tested,
what comes next

Each protocol is preregistered before data collection. Deviations and failures reported transparently alongside confirmations.

Protocol 1
Complete

Δ-Variable Confirmatory Campaign

75 confirmatory runs across 5 preregistered cost conditions. Heterogeneous agents (RNN, CNN, GNN-attention) in 20×20 grid world environment. P1–P4 confirmed. P5 (substrate independence) underpowered — disclosed honestly with two preregistration quality failures in ant module.

Protocol 2
Complete

Virtue Theater — Regulatory Constraint Failure

Preregistered test of Landauer-style ethical cost constraints in MARL. 20 confirmatory runs (10 seeds × 2 conditions × 500 epochs). Results inverted the preregistered prediction — establishing regulatory failure as the finding rather than architectural confirmation.

Key finding: Constrained agents converged on query-flooding (query rates 0.74–0.94) as tax evasion — virtue theater. Mean SSS = 0.303 vs 0.725 unconstrained. Cohen's d = −2.18. Four of ten seeds independently found the gaming attractor.
Protocol 3
In design

Architectural Necessity Under Resource Depletion

Protocol 2 established regulatory failure. Protocol 3 tests the deeper claim: does exploitation actually exhaust the environment, removing conditions for interrogative behavior? This requires a harness with genuine resource depletion — absent from Protocol 2's constant-value target. Target n=30–50 per condition to narrow the gaming rate confidence interval established in Protocol 2.

Publications

Open science,
honest reporting

All preregistrations, data, and code published before results. Failures reported alongside confirmations.

2026 · Preprint · MARL · Ethics · Specification gaming
Virtue Theater: Specification Gaming and Regulatory Constraint Failure in Multi-Agent Systems
Preregistered experimental demonstration that regulatory ethical constraints systematically produce specification gaming in simple MARL agents. Results inverted the prediction (d = −2.18). Introduces virtue theater as a formal failure mode taxonomy.
PDF →
2026 · Zenodo · DOI: 10.5281/zenodo.18929040
Testing Ethical Constraints as Architectural Necessity in Multi-Agent Reinforcement Learning Systems — Preregistration v3
Locked preregistration for Protocol 2 confirmatory campaign. SHA-256 verified. Published before data collection.
DOI →
2026 · Zenodo · DOI: 10.5281/zenodo.18975095
Protocol 2 Confirmatory Campaign Build Report — constraint-ethics-necessity
Complete per-seed data, deviation log, statistical results, and git history for the virtue theater study. All four preregistration deviations disclosed.
DOI →
2026 · Zenodo · DOI: 10.5281/zenodo.18738379
Interrogative Structure Emergence Under Energy Constraints — Δ-Variable Confirmatory Campaign
P1–P4 confirmed across 75 runs. Honest disclosure of P5 underpowering and two preregistration quality failures.
DOI →
2025 · Zenodo · HDT² Framework
HDT² — A Pilot Framework for Entropy-Band Calibration of LLM Reasoning Stability
Measures reasoning quality through entropy variance. Validated across Qwen, Mistral, and Llama models on large HH-RLHF dataset.
DOI →
Loading Zenodo deposits…
Demonstrations & Tools

Live systems,
open for exploration

Operational tools embodying the research. Experimental — expect iteration.

Interactive

Q-ISA Explorer

Interactive exploration of Question–Intent–Signal–Answer structures and interrogative geometry. The primary research demonstration interface.

Launch →
Evaluation

Q-ISA LLM Judge Explorer

Evaluation interface for observing and comparing LLM reasoning behavior using Q-ISA-based judging criteria. Useful for alignment evaluation work.

Launch →
Extended

Q-ISA Explorer v160

Extended version with additional operators and analysis depth. More experimental than the primary explorer.

Launch →
Safety

Φ-SEAL GPT

Epistemic boundary and reasoning-containment tool for safety, risk, and decision-critical contexts. Built on the PhiSeal framework.

Launch →
Protocol

DDRP Walkthrough

Live walkthrough of the Deterministic Document Review Protocol. Auditable, cryptographically hashed document analysis infrastructure.

View →
Lab

Δ-Variable Lab Notebook

The live experiment dashboard and run history for the Δ-Variable MARL study. All 75 confirmatory runs logged.

View →
Writing

Where the thinking
gets philosophical

Essays on epistemology, cognition, AI, and the nature of inquiry. The longer arc of the research program, written for a wider audience.

Loading articles…
All articles on Medium →
Contact & Collaboration

Open for inquiry

The repositories are open for exploration, critique, and extension. Researchers, engineers, and theorists are invited to experiment, fork, challenge assumptions, or propose new mechanisms.

If you are testing a hypothesis, challenging a finding, or building something adjacent — reach out. The work evolves through contact.

Currently available
Contract & remote work

Available for contract and remote roles in AI evaluation, LLM safety, and applied reasoning research. Background spans network engineering, healthcare IT, and culinary operations management — pattern recognition across domains is the throughline.

AI Evaluation LLM Safety Alignment Research MARL Preregistered Studies Specification Gaming