Static AI-safety evaluations — red-team passes, model cards, pre-deployment evals — certify a system at one moment in time. Production agents drift, get re-prompted, get new tools, and hit adversarial inputs the eval set never covered. A "safe" agent on Monday is an incident on Friday — and there is no audit-grade record of which action crossed the line, when, or why.
The insurance industry cannot underwrite what it cannot evidence. Without continuous, signed, tamper-evident proof of every consequential action an agent took and which control reviewed it, autonomous-agent liability is uninsurable at scale.
Continuous Agent Attestation (CAA) is a runtime methodology that produces a cryptographically verifiable record of every consequential action an autonomous agent attempts, the policy that evaluated it, and the verdict that was rendered — in <25 ms, fail-closed, with no application-side state.
Every tool call passes through an enforcement gate before execution. Logs prove what happened; attestation prevents what shouldn't. The distinction matters at claims time: a logged breach is a paid claim; a blocked one is not.
Each policy is a declarative record (tool_pattern, arg_value_regex, action ∈ {ALLOW, BLOCK, HOLD}, severity, priority). Lowest priority number takes precedence. Policies are versioned, diff-able, and reviewable by non-engineers — including underwriters and regulators.
Each decision record contains:
{decision_id, agent_id, tool, args_hash, verdict, policy_name,
severity, timestamp_utc, prev_hash, signature}
where signature = HMAC-SHA256(per_tenant_key, canonical(record)) and per_tenant_key = HKDF-SHA256(master_key, agent_id). Mutating or deleting any record breaks every subsequent signature — tampering is mathematically detectable. Chain validity is a necessary condition for a paid claim.
If the gate is unreachable, the default verdict is BLOCK. Fail-open is opt-in, per-client, and itself a recorded policy decision. The economic incentive — a fail-open client foregoes coverage for that action class — drives the right default.
┌──────────┐ tool call ┌────────────────┐ verdict ┌──────────┐
│ Agent │ ───────────────► │ Enforcement │ ──────────► │ Tool │
│ (any LLM)│ │ Gate │ (allow) │ exec │
└──────────┘ │ │ └──────────┘
│ • match │
│ • sign │ (block/hold) → exception
│ • chain │
└───────┬────────┘
│ append
▼
┌──────────────────────┐
│ Audit Chain │
│ (HMAC-linked) │ ── verifiable by
│ │ auditor / claims
└──────────────────────┘
Measured performance (April 2026, pilot tier): p50 = 2.1 ms, p99 = 3.1 ms at 200 RPS sustained; p99 = 44 ms at 1 000 RPS burst. Zero errors across 11 000 evaluation calls. Throughput: >4 000 decisions/sec/core.
The shipped policy catalogue v1.0 maps to:
We are publishing CAA as an open methodology, not a closed product spec, because the following benefit from external scrutiny:
A working reference implementation is available as a Python SDK:
pip install neuravant-enforce
Source, test suite (124+ tests, end-to-end), and audit-chain verifier (nail-verify) are open for inspection. The SDK is the same gate Neuravant uses for our own underwriting evidence — there is no "professional version."
Live audit dashboard: /enforce.html
Neuravant is actively seeking research collaboration on the open questions above with academic groups, government safety institutes, and standards bodies. We will co-author, share datasets (suitably anonymised), and publish negative results.
Contact: research@neuravant.ai · Dillman Hunte, Founder