What is the ASI Evolution Engine?

The ASI Evolution Engine is Treeova's self-improving configuration system. It uses a four-agent pipeline — Researcher, Engineer, Analyzer, Judge — to generate, evaluate, and selectively adopt configuration changes for named platform domains under hermetic evaluation contracts and status-based mutex safety.

What does 'self-improving' mean here?

It means the engine proposes and evaluates changes to platform configuration — not application code, and not user data. A proposed change is only adopted when it passes a hermetic evaluation contract and clears a Judge agent's review. Nothing self-deploys without a structured pass through the pipeline.

What are the four agents in the pipeline?

The Researcher gathers prior cognition relevant to the target domain. The Engineer drafts a candidate configuration change. The Analyzer runs the candidate against the domain's evaluation contract. The Judge reviews both the candidate and the analysis and issues a structured adopt / reject / iterate verdict.

What is a hermetic evaluation contract?

A hermetic evaluation contract is a domain-specific specification of inputs, the candidate change, and the deterministic outputs that must be produced for the change to be considered evaluable. The contract isolates the evaluation from outside state, so the same candidate produces the same evaluation result across runs.

What is the status-based mutex?

The status-based mutex is the safety mechanism that prevents two evolution runs in the same domain from racing each other. A domain in an active evolution status cannot start another run in the same status; the engine enforces this at the data layer, not at the application layer.

What does this whitepaper deliberately withhold?

It withholds the Judge's scoring rubric and threshold values, the mutation operators used by the Engineer, the per-domain evaluation thresholds, the model routing used per agent role, and the exact prompts. These constitute the platform's competitive surface.

ASI Evolution Engine Whitepaper

WP-04 — ASI Evolution Engine: Self-Improving Configuration

The ASI Evolution Engine is Treeova's self-improving configuration system. A four-agent pipeline — Researcher, Engineer, Analyzer, Judge — generates and evaluates candidate configuration changes for named platform domains under hermetic evaluation contracts and a status-based mutex that prevents intra-domain races. Judge rubrics, mutation operators, eval thresholds, per-role model routing, and exact prompts are intentionally withheld.

Authored by Treeova Research· Research CollectiveUpdated 2026-04-18

#1. Overview

The ASI Evolution Engine is the system Treeova uses to evolve its own configuration. It does not rewrite application code, and it does not touch user data. Its job is to propose candidate changes to the named configuration domains that govern platform behavior, evaluate those candidates under hermetic contracts, and adopt only the candidates that demonstrably improve the relevant domain.

The design is deliberately conservative. Every adoption is the end of a fully recorded pipeline run: Researcher evidence, Engineer draft, Analyzer evaluation, Judge verdict. Nothing self-deploys.

See the product surface for this methodology: Feature page for Kronos.

#2. The Four-Agent Pipeline

Researcher. Pulls prior cognition relevant to the target domain — past evaluations, calibration trends, cross-domain learnings — from the engine's cognition store.
Engineer. Drafts a candidate configuration change shaped to the target domain's contract.
Analyzer. Runs the candidate through the domain's hermetic evaluation contract and produces a structured evaluation report.
Judge. Reviews candidate and evaluation together and returns one of three structured verdicts: adopt, reject, or iterate.

The four roles are decoupled by design. Each role's output is structured input to the next role, and each role's run is individually auditable.

#3. Evolution Domains

The engine operates over a fixed set of named domains, each with its own evaluation contract. Public framing covers nine domains spanning intelligence, risk, calibration, agent infrastructure, and content surfaces. Domains are named publicly so cross-domain cognition can be referenced; the contents of each contract are withheld.

#4. Hermetic Evaluation Contracts

A hermetic evaluation contract is a domain-specific specification that fixes the inputs, the candidate change format, and the deterministic outputs an Analyzer run must produce for the candidate to be considered evaluable.

The contract isolates the evaluation from outside state. The same candidate, re-evaluated tomorrow, produces the same evaluation result. This is what makes the Judge's verdict defensible — the verdict is a function of the candidate and the contract, not of ambient runtime conditions.

#5. Cross-Domain Cognition Store

Each completed run contributes to a shared cognition store. The Researcher consults this store on the next run, so improvements in one domain can inform investigations in adjacent domains. Cognition entries are scoped, dated, and revisable — not a monolithic prompt blob.

#6. Status-Based Mutex Safety

To prevent two evolution runs from racing inside the same domain — for example, two Engineers concurrently drafting candidates against the same baseline — the engine enforces a status-based mutex at the data layer. A domain whose status indicates an active evolution cannot start another run in the same status.

Enforcing the mutex at the data layer rather than in application code means the safety property survives application restarts, parallel workers, and retry loops. The mutex is a property of the system, not of any one process.

#7. The Judge Role

The Judge is the only role permitted to return an adopt verdict. Its inputs are the candidate, the Analyzer's evaluation report, and the relevant cognition. Its output is a structured verdict with rationale. The Judge's scoring rubric and threshold values are intentionally withheld; the Judge's role and contract are public.

#8. What This Whitepaper Withholds

The Judge's scoring rubric and threshold values.
The mutation operators the Engineer is allowed to apply.
Per-domain evaluation thresholds and pass/fail bands.
The model routing used for each role.
The exact prompts used inside any role.

#9. Limitations

The engine evolves configuration, not application logic or user data. Configuration changes are powerful but bounded.
Hermetic contracts make evaluations reproducible, but they cannot guarantee that the contract itself captures every relevant production effect. Contracts are reviewed and revised; no contract is permanently complete.
The status-based mutex prevents intra-domain races, but cross-domain dependencies still require human judgment when two domains are co-evolved.
Candidate generation is bounded by the Engineer's mutation space. Genuinely novel configurations outside that space require human authoring, not autonomous evolution.
Nothing the engine does is investment advice. Evolution improves platform behavior; trading outcomes depend on markets, not on configuration alone.