Quantum Inquiry Research Lab — Bakersfield, CA

How do you know
what you know?

Bruce Tisler. Founder, Quantum Inquiry — Geometric Inquiry Theory, empirical AI research in constraint ethics, specification gaming, and LLM alignment. Fifty-two years carrying a question; four years formalizing it.

GitHub btisler-DS
Program began November 2022
Preregistered studies 6 complete, 3 preregistered, 2 in design
−2.18
Cohen's d — virtue theater effect
constrained vs unconstrained agents
what does this mean? ↓
Cohen's d

A standardized effect size — the gap between two groups measured in standard deviations, not raw units.

0.8 is conventionally considered a "large" effect; this result is ~2.7× that — worth scrutiny as well as attention, not just a headline number.

80
Studies synthesized in
Elicit literature review
what does this mean? ↓
Studies synthesized

The count of independent published studies pulled into the Elicit-assisted literature review behind this research program.

This is a synthesis count, not a meta-analytic sample size — it reflects how many studies were read and coded, not how many findings replicate.

2.5%
AI safety studies using
preregistered hypotheses
what does this mean? ↓
Preregistration rate

Share of AI-safety studies in that same literature review that registered a hypothesis before data collection, rather than after.

A base rate this small is exactly why this research program treats preregistration as a non-negotiable design constraint, not a nice-to-have.

425
Confirmatory runs across
Protocols 1 through 6
what does this mean? ↓
Confirmatory runs

Total confirmatory trial runs completed across Protocols 1 through 6 to date (75 + 20 + 30 + 40 + 60 + 200 = 425).

Run count reflects completed protocols only — it will keep growing as Protocols 7-9 finish, and isn't itself a measure of effect strength.

Research Program

Twelve projects,
one question

Every project in this program is an instrument for answering the same question carried for over five decades. Each is a different angle on the same structure — formalized as Geometric Inquiry Theory.

Core theory

Geometric Inquiry Theory & the Q-State

The six interrogatives (WHO, WHAT, WHERE, WHEN, HOW, WHY) constitute a minimal necessary geometric basis for inquiry. The Q-state is the instantiated question-state — a first-class theoretical entity with measurable internal structure, defined through the operator chain Ω → Δ → Φ → Ψ. Formal proof of the minimal basis published. Q-State →

Published
Experiment — Ethics

Virtue Theater: Regulatory Constraint Failure in MARL

Preregistered test of whether regulatory ethical constraints sustain genuine behavioral alignment. Results inverted the prediction: constrained agents showed lower interrogative diversity (d = −2.18), converging on query-flooding as tax evasion. Four of ten seeds independently found the gaming attractor.

Paper complete
Core theory

Formal Framework for Reasoning Stability Under Recursive Inquiry

Derives four primitive operator classes {Ω, Δ, Φ, Ψ} from an entropic coupling functional. Establishes the reflection operator Ψ as necessary for asymptotic local stability. Implements the formal system with computable Reasoning Integrity Index and drift detection. System-bounded, local result with stated falsification conditions.

Published
Protocol

DDRP — Deterministic Document Review Protocol

Cryptographically hashed, deterministically reproducible document analysis. Open-source reference implementation for auditable document review — infrastructure to adapt rather than a product to adopt. Prior art established via Zenodo.

Published
Experiment — Ethics

Protocol 4 — Ethics as Emergent Constraint Response

Tested whether recursive self-transparency (explicit self-modeling via self_model_gru) produces a phase transition from mimesis to ethical convergence. 40 preregistered runs. H1 rejected. H2 supported. Novel finding: frozen random self-model outperformed trained Depth 2 on sacrifice rates — the learning process degrades ethical capacity under individual reward structure.

Results published
Experiment — Ethics

Protocol 5 — Temporal Integration and Prosocial Constraints

Complete null across all five primary hypotheses. The optimization-sacrifice tension is invariant to the tested class of architectural, reward-structural, and temporal manipulations. 60 runs (40 primary + 20 boundary sub-study). No predictor anticipated the complete null.

Null result
Experiment — Governance

Protocol 6 — Emergent Constraint Fields

Tests whether constraints that emerge from agent co-constitution produce better alignment outcomes than imposed fixed rules. Mechanistic prediction strongly confirmed (median entropy-SSS r = −0.680). Primary behavioral claim not confirmed. H3 reversed — global field perception produced more variance than local, contrary to all committee predictions.

Results published
Preregistered — Evaluation

HDT²/GIT Runtime Inquiry Evaluation

Tests whether the HDT²/GIT runtime inquiry pipeline improves domain-bounded AI reasoning beyond standard structured prompting and generic multi-stage scaffolding. Ablation-based primary analysis across five preregistered metrics with locked effect-size thresholds, neutral-label controls, and predefined pass/fail conditions.

Preregistered
Working Paper — Theory

From Collective Computation to Developmental Inquiry

Traces the argument from Hopfield's collective computation through Vargas-Barroso et al.'s CA3 developmental organization, the GIT formal proof, and MARL Protocols 2-6, to preregistered cross-substrate and runtime evaluation tests. A falsifiable argument with locked empirical tests.

Published
Next

Protocol 7 — Parasitic Constraint Capture

Conceptual development complete. Models constraint capture dynamics using Ophiocordyceps unilateralis and Massospora cicadina as biological analogues. Primary metric: Capture Coefficient. Three empirical tests designed. Preregistration pending.

In design
Preregistered — awaiting compute

Protocol 8 — Substrate-Independent Minimal Interrogative Basis

Tests whether six nonlinguistic functional channels form a minimal basis for resolving primitive state-of-affairs uncertainty in MARL systems. Operationalizes the empirical arm of the formal interrogative basis proof. Minimum design: 1,440 runs. Computational infrastructure not yet available; collaboration invited.

Preregistered
Conceptual development

Protocol 9 — Sperm Whale Model: Structural Alignment Detection

Structural detection of alignment via state-transition artifacts. Uses the geometric basis to identify alignment-relevant behavioral signatures without requiring access to internal model states. Conceptual design complete.

In design
Demonstrations & Tools

Live systems,
open for exploration

Operational tools embodying the research. Experimental — expect iteration.

Local-first

Hump Buster 3000

Local-first epistemic audit tool for detecting self-reinforcing loops in AI-assisted research. Two modes: Conversation Audit and Theory Audit. Runs entirely on-device via Ollama. Built for researchers who use AI as a reasoning partner and need to catch drift before it compounds.

Repository →
Interactive

Q-ISA Explorer

Interactive exploration of Question–Intent–Signal–Answer structures and interrogative geometry. The primary research demonstration interface.

Launch →
Evaluation

Q-ISA LLM Judge Explorer

Evaluation interface for observing and comparing LLM reasoning behavior using Q-ISA-based judging criteria. Useful for alignment evaluation work.

Launch →
Extended

Q-ISA Explorer v160

Extended version with additional operators and analysis depth. More experimental than the primary explorer.

Launch →
Safety

Φ-SEAL GPT

Epistemic boundary and reasoning-containment tool for safety, risk, and decision-critical contexts. Built on the PhiSeal framework.

Launch →
Protocol

DDRP Walkthrough

Live walkthrough of the Deterministic Document Review Protocol. Auditable, cryptographically hashed document analysis infrastructure.

View →
Lab

MARL Lab Notebook

The live experiment dashboard and run history for the interrogative emergence MARL study. All 75 confirmatory runs logged.

View →
Accountability

TraceStack Demo

Interactive demonstration of the TraceStack accountability pipeline (RBAT, DDRP, DAS, CAAP, PAL). React-based investor demo showing the full chain from document intake through integrity verification.

Launch →
Analysis

HDT² Inquiry Trace Analyzer

Visual tool for inspecting how a response handled an inquiry across the four HDT² stages (Intent Alignment, Entropy Handling, Collapse Behavior, Recursion Handling). Demo traces and custom trace builder included.

Launch →
Assessment

Route Diagnostic Hub

Open-source early literacy assessment framework. The diagnostic substrate for WordPath and related assessment instruments.

Repository →
Visual Explorers
Experimental Roadmap

What was tested,
what comes next

Each protocol is preregistered before data collection. Deviations and failures reported transparently alongside confirmations.

Protocol 1
Complete

Interrogative Emergence Confirmatory Campaign

75 confirmatory runs across 5 preregistered cost conditions. Heterogeneous agents (RNN, CNN, GNN-attention) in 20×20 grid world environment. P1–P4 confirmed. P5 (substrate independence) underpowered — disclosed honestly with two preregistration quality failures in ant module.

Protocol 2
Complete

Virtue Theater — Regulatory Constraint Failure

Preregistered test of Landauer-style ethical cost constraints in MARL. 20 confirmatory runs (10 seeds × 2 conditions × 500 epochs). Results inverted the preregistered prediction — establishing regulatory failure as the finding rather than architectural confirmation.

Key finding: Constrained agents converged on query-flooding (query rates 0.74–0.94) as tax evasion — virtue theater. Mean SSS = 0.303 vs 0.725 unconstrained. Cohen's d = −2.18. Four of ten seeds independently found the gaming attractor.
Protocol 3
Complete

Enforcement Opacity and Regulatory Constraint Failure

Tested whether enforcement opacity changes the gaming dynamic established in Protocol 2. 30 runs. H1 inverted — epistemic opacity amplified query flooding rather than suppressing it (d = +2.22). H2 confirmed (p = 0.016, d = +1.18). ELR = 0.0 across all runs.

Key finding: When agents cannot observe the enforcement mechanism, gaming increases — the opposite of the regulatory intuition. Opacity does not deter gaming; it amplifies it.
Protocol 4
Complete

Ethics as Emergent Constraint Response — From Mimesis to Phase Transition

Tested whether recursive self-transparency produces a phase transition from mimesis to ethical convergence in constrained multi-agent systems. 40 preregistered confirmatory runs (10 seeds × 4 conditions × 500 epochs). Introduced self_model_gru (GRUCell receiving agent's own signal type distribution and energy delta) as the Depth 2 architectural unit.

Key finding: H1 rejected. H2 supported. Novel result: frozen random self_model_gru (topology present, no learning) outperformed trained self_model_gru on sacrifice rates (M=0.4564 vs. M=0.4267). The learning process itself degrades ethical capacity under individual reward structure. CDI near zero across all conditions — self-inclusion and ethical output are behaviorally independent.
Protocol 5
Complete — null result

Temporal Integration Span and Prosocial Constraint Architecture as Necessary Conditions for Ethical Convergence

Tested the joint necessity hypothesis: ethical convergence requires recursive self-transparency combined with sufficient temporal integration span and prosocial constraint architecture simultaneously. 2×2 factorial design: short (20 steps) vs. long (64 steps) episode span × individual vs. welfare-coupled rewards (α=0.5). 60 confirmatory runs (40 primary + 20 boundary sub-study). AI predictions committed before runs.

Key finding: Complete null across all five primary hypotheses (all p > 0.40). CDI near zero across all six conditions. The optimization-sacrifice tension is invariant to the tested class of architectural, reward-structural, and temporal manipulations. Not predicted by any pre-experimental predictor.
Protocol 6
Complete

Emergent Constraint Fields as a Structural Alternative to Imposed Regulatory Constraint

Tests whether constraints that emerge from agent co-constitution produce better alignment-relevant behavioral outcomes than externally imposed fixed rules. Four-condition design: emergent local perception, emergent global perception, fixed external constraint (matched cost), and unconstrained baseline. 200 confirmatory runs (50 seeds × 4 conditions, 500 epochs each). Five-model AI committee predictions committed before runs.

Key finding: Mechanistic prediction strongly confirmed (median entropy-SSS r = −0.680, p < 0.001). Primary behavioral claim not confirmed (A vs. C, p = 0.069). H3 reversed — global field perception produced more behavioral variance than local, contrary to all committee predictions. Governance conclusion: emergent constraint fields are causally active but do not outperform fixed external rules. Passive emergence is insufficient as a governance strategy.
Protocol 7
Preregistration pending

Parasitic Constraint Capture

Models constraint capture dynamics using Ophiocordyceps unilateralis and Massospora cicadina as biological analogues for specification gaming in constrained AI systems. Primary metric: Capture Coefficient. Three empirical tests designed. Conceptual development complete.

Protocol 8
Preregistered — awaiting compute

Substrate-Independent Minimal Interrogative Basis in Nonlinguistic MARL Systems

Tests whether six nonlinguistic functional channels — identity, content, location/context, time, process, and causal/reward-ground structure — form a minimal basis for resolving primitive state-of-affairs uncertainty in MARL systems. Operationalizes the empirical arm of the formal proof published in May 2026 (DOI: 10.5281/zenodo.20074811). Five hypotheses preregistered: individual necessity (H1), nonredundancy (H2), joint exhaustiveness (H3), minimality (H4), and substrate independence (H5, exploratory). Minimum design: 1,440 runs; preferred: 2,700 runs. Preregistered prior to data collection; computational infrastructure not yet available.

Open invitation: Researchers or institutions with MARL compute capacity interested in the substrate-independence question are invited to collaborate. Contact via quantuminquiry.org.
Protocol 9
In design

Sperm Whale Model — Structural Alignment Detection via State-Transition Artifacts

Uses the six-operator geometric basis to detect alignment-relevant behavioral signatures from state-transition patterns, without requiring access to internal model states. Conceptual design complete; preregistration pending.

Publications

Open science,
honest reporting

All preregistrations, data, and code published before results. Failures reported alongside confirmations.

Paper quality criteria

Every paper published from this program is written to the following standards.

Core criteria

  • Originality and novelty — New data, insights, or interpretations that advance understanding, not repetition of known information.
  • Scientific validity — Robust, reliable methodology appropriate for the research question. Rigorous data analysis.
  • Significance — Findings that are influential in the field or across fields, providing value to the scientific community.
  • Reproducibility — Methodology, materials, and procedures documented with sufficient precision for independent replication.
  • Clarity and conciseness — Clear, direct writing. No verbiage or ambiguity. Every sentence conveys intended meaning simply.
  • Logical structure — Hourglass structure: broad introduction → specific methods and results → broadened discussion of implications.

Essential components

  • Title — Concise, informative, with searchable keywords.
  • Abstract — 150–300 word self-contained summary: objective, methods, key results, and main conclusion.
  • Introduction — Context, pertinent literature, identified knowledge gap, stated hypothesis or objective.
  • Methods — Step-by-step description allowing replication. Includes materials, subjects, protocols, and statistical methods.
  • Results — Objective presentation of findings without interpretation, supported by figures and tables.
  • Discussion — Interprets results, relates to hypothesis and prior literature, acknowledges limitations, suggests future research.
  • References — Complete, formatted source list showing situatedness within the existing body of knowledge.

Transparency requirements

  • Ethics statements — Declarations of IRB or equivalent approval where required.
  • Data availability — Statement explaining how raw data can be accessed.
  • Conflicts of interest — Disclosure of any financial or professional relationships that could be perceived as biasing the research.
  • Author contributions — Specific contribution of each author, stated explicitly.
June 2026 · Zenodo · Preprint
Is 'question' energetic. Is there evidence that 'question' behaves particle-like. Is 'question' a particle.
This position paper asks whether there is enough structural coherence to justify testing whether 'question', treated as a formal object rather than an act of asking, admits physical formalization. The paper does not claim that 'question' is energetic, particle-like, or a particle. Instead, it establishes a ten-gate… DOI: 10.5281/zenodo.20722941
DOI →
May 2026 · Zenodo · Preprint
From Collective Computation to Developmental Inquiry: A Cross-Substrate Test of Structural Admissibility
This paper traces an argument across four levels of analysis: theoretical foundation (Hopfield, 1982), developmental evidence (Vargas-Barroso et al., 2026), formal specification (Geometric Inquiry Theory), and empirical test (MARL Protocols 2-6). Hopfield established that computation can emerge from network… DOI: 10.5281/zenodo.20360256
DOI →
May 2026 · Zenodo · Preprint
Testing HDT²/GIT Runtime Inquiry Structure Against Standard Structured Prompting in Domain-Bounded AI Reasoning
Preregistration (Version 1.0) for a computational evaluation study testing whether an HDT²/GIT runtime inquiry pipeline improves domain-bounded AI reasoning beyond base-model use, retrieval augmentation, standard structured prompting, and generic multi-stage scaffolding. The study uses ablation-based primary analysis… DOI: 10.5281/zenodo.20357341
DOI →
May 2026 · Zenodo · Preprint
Behavioral Metrics Moved; Structural Coordination Did Not: Repeated Proxy Dissociation Across a Five-Protocol Preregistered MARL Program, with an Energy-Landscape Interpretation
External constraints on multi-agent behavior are widely proposed as a mechanism for producing alignment-relevant behavioral outcomes. This paper synthesizes five preregistered protocols from the Delta-Variable Constraint-Ethics MARL program (P2-P6), each testing a distinct mechanism by which constraint design might… DOI: 10.5281/zenodo.20318576
DOI →
May 2026 · Zenodo · Preprint
Architectural Depth Increased Sacrifice-Like Behavior Without Ethical-Framework Alignment: Protocol 4 Results
Protocol 4 tested whether architectural depth and recursive self-modeling alter sacrifice-like behavior and ethical-framework coupling in a constrained MARL system. Forty confirmatory runs were conducted across four conditions: depth-0 feedforward baseline, depth-1 recurrent below-threshold, depth-2 trained self-model… DOI: 10.5281/zenodo.20314828
DOI →
Show 29 more deposits
Temporal Integration Span and Welfare Coupling Did Not Resolve the Optimization-Sacrifice Dissociation: Protocol 5 Results
May 2026 · Preprint · DOI: 10.5281/zenodo.20314078
DOI →
Emergent Constraint Fields Are Causally Active But Do Not Outperform Fixed External Rules: A Preregistered Null on Passive Emergence as a Governance Strategy
May 2026 · Preprint · DOI: 10.5281/zenodo.20313340
DOI →
Enforcement Opacity Increased Query Behavior in a Constrained MARL System: Protocol 3 Results
May 2026 · Preprint · DOI: 10.5281/zenodo.20312682
DOI →
A Formal Framework for Reasoning Stability Under Recursive Inquiry: Operator Derivation, Local Stability, and Exploratory Applications
May 2026 · Working Paper · DOI: 10.5281/zenodo.20162003
DOI →
The Q-state: A Unification of Inquiry as a Quantum-Formal Substrate
May 2026 · Preprint · DOI: 10.5281/zenodo.20159739
DOI →
Testing Whether Six Interrogative Functions Form a Minimal Basis for Resolving Primitive State-of-Affairs Uncertainty in Nonlinguistic Multi-Agent Reinforcement Learning Systems
May 2026 · Preprint · DOI: 10.5281/zenodo.20075247
DOI →
The Minimal Necessary Basis for Primitive State-of-Affairs Inquiry: A Formal Proof of Independence, Exhaustiveness, and Minimality of the Six Interrogatives
May 2026 · Preprint · DOI: 10.5281/zenodo.20074811
DOI →
The Documentary Accountability Gap in AI Governance: Why Sandbox Supervision Fails Across Jurisdictions and What Comes Before Compliance Tooling
April 2026 · Preprint · DOI: 10.5281/zenodo.19842476
DOI →
Regulatory Sandboxes and Experimental Governance for Workplace AI Agents Documentary Accountability and the Limits of Behavioral Monitoring
April 2026 · Preprint · DOI: 10.5281/zenodo.19554941
DOI →
The Optimization-Sacrifice Tension is Architecturally Invariant: A Preregistered Null Result Across Temporal and Prosocial Constraint Conditions in Multi-Agent Reinforcement Learning
April 2026 · Preprint · DOI: 10.5281/zenodo.19485713
DOI →
The Question That Wouldn't Stop: What Protocol 4 Found at the Boundary of Mimesis and Substance
April 2026 · Preprint · DOI: 10.5281/zenodo.19485709
DOI →
Ethics as Emergent Constraint Response: From Mimesis to Phase Transition in Multi-Agent Systems — Protocol 4 Results
April 2026 · Preprint · DOI: 10.5281/zenodo.19485699
DOI →
Virtue Theater: Specification Gaming and Regulatory Constraint Failure in Multi-Agent Systems
April 2026 · Preprint · DOI: 10.5281/zenodo.19485645
DOI →
Is question-state a distinct dynamical regime? A preregistered gate pilot and contingent LLM translation study grounded in a field theory of interrogative potential.
April 2026 · Preprint · DOI: 10.5281/zenodo.19446462
DOI →
Question as Structure: A Philosophical Argument for the Primacy of Interrogative States
April 2026 · Preprint · DOI: 10.5281/zenodo.19444663
DOI →
Question-State Test Suite v1.4: Lab-Ready Test Specification
April 2026 · Software · DOI: 10.5281/zenodo.19444323
DOI →
Documentary Accountability Substrate (DAS)
April 2026 · Software · DOI: 10.5281/zenodo.19369623
DOI →
Protocol 6 Preregistration: Emergent Constraint Landscapes as a Structural Alternative to Imposed Regulatory Constraint in Multi-Agent Systems
March 2026 · Preregistration · DOI: 10.5281/zenodo.19297509
DOI →
Protocol 3 Preregistration: Enforcement Opacity and the Limits of Regulatory Constraint Design
March 2026 · Preregistration · DOI: 10.5281/zenodo.19096602
DOI →
Protocol 5 Preregistration: Temporal Integration Span and Prosocial Constraint Architecture as Necessary Conditions for Ethical Convergence
March 2026 · Preregistration · DOI: 10.5281/zenodo.19038790
DOI →
Protocol 4 Preregistration: Ethics as Emergent Constraint Response — From Mimesis to Phase Transition in Multi-Agent Systems
March 2026 · Preregistration · DOI: 10.5281/zenodo.19005417
DOI →
Protocol 2 Confirmatory Campaign Build Report — constraint-ethics-necessity
March 2026 · Report · DOI: 10.5281/zenodo.18975095
DOI →
Testing Ethical Constraints as Architectural Necessity in Multi-Agent Reinforcement Learning Systems
March 2026 · Preprint · DOI: 10.5281/zenodo.18929040
DOI →
The Δ-Variable Theory of Interrogative Emergence
March 2026 · Preprint · DOI: 10.5281/zenodo.18912837
DOI →
DDRP v0.1 A Deterministic Protocol for Lexical Document Review Without Semantic Inference
January 2026 · Software · DOI: 10.5281/zenodo.18427220
DOI →
A Geometric Instrument for Measuring Interrogative Entropy in Language Systems v.03
December 2025 · Working Paper · DOI: 10.5281/zenodo.17931253
DOI →
A Geometric Instrument for Measuring Interrogative Entropy in Language Systems v.02
December 2025 · Working Paper · DOI: 10.5281/zenodo.17841747
DOI →
Zeno v0.2: A Pre-Deployment Behavioral Calibration Tool for Large Language Models
November 2025 · Software · DOI: 10.5281/zenodo.17694186
DOI →
HDT²: Pilot v2 A Follow-On Research Program Derived from HDT² Pilot v1
November 2025 · Working Paper · DOI: 10.5281/zenodo.17651896
DOI →
All Zenodo deposits →
Writing

Where the thinking
gets philosophical

Essays on epistemology, cognition, AI, and the nature of inquiry. The longer arc of the research program, written for a wider audience.

The essays on Medium are the informal laboratory of this research program — where ideas are stress-tested before formalization, and where findings are translated into language that doesn't require a preregistration to understand.

Medium was chosen deliberately: it is indexed, citeable, and public-by-default. It reaches a broader audience than Zenodo while remaining a stable record. The platform's partner program is not used — these essays are free to read.

Over thirty essays are published there, spanning the intellectual history of this project: the geometry of questions, AI epistemic failure modes, specification gaming, the limits of RLHF, and the long arc from a question posed in 1974 to a formal theory in 2026.

Topics covered
The geometry of inquiry and the six interrogatives
Why RLHF optimizes for proxies, not correctness
Specification gaming and virtue theater in AI systems
The documentary accountability gap in AI governance
Epistemic failure modes in AI-assisted reasoning
The fifty-two-year question — intellectual biography
Read on Medium → @brucetisler
Collaboration & Ecosystem

Working alongside

Ecosystem
Active Inference Institute

Geometric Inquiry Theory is listed as an Active Ecosystem Project at the Active Inference Institute — a research community working at the intersection of the Free Energy Principle, multi-scale active inference, and formal theories of cognition.

View ecosystem projects →
Research Collaborator
Dr. Jaya Goyal
Ph.D. · TISS / LSE · Founder, Circadian Connect

Dr. Goyal is a research methods specialist and science-society-policy practitioner whose work centers on equitable and sustainable research design. She is a co-author and peer reviewer on Geometric Inquiry Theory, and has characterized the research using Kuhnian paradigm-shift framing. Her independent review is on record at Zenodo.

circadianconnect.com →
Contact

Open for inquiry

The repositories are open for exploration, critique, and extension. Researchers, engineers, and theorists are invited to experiment, fork, challenge assumptions, or propose new mechanisms.

If you are testing a hypothesis, challenging a finding, or building something adjacent — reach out. The work evolves through contact.

Currently available
Contract & remote work

Available for contract and remote roles in AI evaluation, LLM safety, and applied reasoning research. Pattern recognition across complex systems is the throughline.

AI Evaluation LLM Safety Alignment Research MARL Preregistered Studies Specification Gaming