Aria Harness
Runtime for autonomous AI workforces

Stop renting intelligence. Build an AI workforce that remembers, researches, coordinates, and executes under governance.

Aria Harness is not another agent shell and it is not a prompt costume for the same fragile model behavior. It is a runtime that raises cognition quality itself: memory continuity, Aegis anti-hallucination control, Hive coordination, Lobster research, authority tiers, verified proof telemetry, and persistent execution across Claude Code, Codex, aria-cli, and harness-linked workers. This is the layer that turns raw inference into durable operational intelligence.

75 agents / 15 departmentsCognition amplificationAegis anti-hallucination layerLobster research laneCross-session drift preventionAuthority-tiered autonomy
75 / 15source-backed architecture: 75 agents across 15 departments
Verifiedverified proof snapshots from the real telemetry service, not toy demos
Aegisanti-hallucination gates before edits, commands, and emission
Persistentmemory substrate, session locks, receipts, queue state, and recovery
01
Cognitive Runtime KernelLocal encrypted runtime state, receipts, memory substrate, telemetry, leases, and cognition scaffolding.
02
Aegis Governance LayerInterception and validation gates that suppress hallucinated edits, false certainty, and unsafe action.
03
Hive CoordinationAgents heartbeat, lock, message, lease work, and stay synchronized as one operational field instead of isolated chats.
04
Workforce + Lobster ResearchAuthority-tiered agents plus deep research and outside-world evidence synthesis routed back into the runtime.
Lobster runs the research lane: external recon, source triangulation, competitor sweeps, and evidence packets.OpenClaw workers bind through CONSULT -> BIND -> ACT instead of improvising blind execution.The model does inference. The runtime raises cognition quality, continuity, safety, and operational coherence.
Runtime Proof

Visible evidence that the runtime is raising quality across real coding surfaces

This is the public proof layer for Aria Harness: verified runtime quality signals, actor compliance, source coverage, and platform activity across Codex, Claude Code, Aria CLI, and related harness execution surfaces.

Verifying proof payload

Proof Metrics

Loading runtime proof metrics...
Awaiting first payload
Loadingwaiting for telemetry

Platform Mix

waitingactor breakout pending

Observed Sources

Loadingsource rows pending

Guard Pressure

Loadingfailure rows pending

Integration Gaps

Loadinggap rows pending
Category Thesis

The AI industry does not just have a model problem. It has a runtime problem.

Better prompts do not fix amnesia. Bigger context windows do not fix coordination. More models do not fix governance. Most AI products still collapse under long-horizon work because they have no durable cognition substrate, no authority model, no research lane, no anti-hallucination enforcement, and no operational memory. Aria Harness exists for the layer that actually decides whether an agent becomes dependable or delusional.

Industry failure

Agents sound smart, then lose the plot.

Session amnesia, weak retrieval, ungoverned tool use, fake confidence, and cross-session collisions make many agent products look impressive in demos and unreliable in real work.

Harness answer

Runtime-quality intelligence instead of prompt-quality theater.

Aria Harness adds continuity, proof, governance, research, recovery, authority tiers, and live workforce coordination so quality compounds instead of decaying over time.

Why labs care

It makes stronger models materially more useful

If frontier labs want agents to survive real enterprise deployment, they need a runtime that reduces hallucination, holds continuity, governs autonomy, and surfaces evidence. Harness is that missing layer.

Why builders care

It solves what breaks shipping teams first

Fewer collisions, better evidence, live queue state, safer autonomy, stronger research, clearer receipts, and much higher confidence that the next action still matches the mission.

What the runtime does

Infrastructure for cognition, governance, research, and workforce execution

Receipts

Receipt chains that preserve causality, lineage, and reasoning integrity

Every pre, mid, and post cognition phase can mint receipts. Later actions inherit validated prior context instead of improvising from decayed state or bluffing over missing reasoning.

Memory

Flexible memory substrate that cures context loss

The runtime preserves, retrieves, and reinjects what matters so agents stop forgetting decisions, dropping continuity, flattening nuance, or restarting large projects from scratch.

Aegis governance

Anti-hallucination gates before edits, commands, emission, and autonomous action

Pre-action cognition, Aegis interception, and post-output validation turn execution into a governed surface rather than a raw model impulse.

Lobster research

Outside-world recon, source triangulation, and evidence packets

Lobster is the research lane for the workforce: external scans, competitive mapping, policy and standards sweeps, source comparison, and briefings that come back into the runtime as evidence instead of vibes.

AI Workforce

A real autonomous workforce, not one general model pretending to be a company

The source-backed architecture today is a 75-agent, 15-department AI workforce, with broader HQ surfaces exposing well over 90 named roles. Each role is coded with its own function, authority posture, and runtime placement. This is the opposite of a single-agent wrapper with brand voice.

75 agents15 departments across executive, acquisition, finance, compliance, operations, intelligence, SEO, and more.

Harness treats the workforce as a system: not just chat personas, but coordinated roles with hierarchy, runtime type, and authority posture.

Lobsterresearch lane for external intelligence, verification, and evidence synthesis.

Lobster equips the workforce with outside-world awareness so planning, sales, operations, and engineering can act on evidence instead of speculation.

90+ rolesbroader named roster surfaces already exceed ninety public-safe roles.

The visible roster stretches from GHOST and HAWK to TrendBot, REISiteBot, Architect, Locksmith, Oracle, Overwatch, and Watchdog.

Executive + councils

Aria and the command layer

Aria, Coding Council, Strategy Council, Agentic Army, Voice Agents, Evolution Stack, Auto Rules Engine.

AriaCoding CouncilStrategy CouncilAgentic ArmyVoice AgentsEvolution StackAuto Rules Engine
SPECOPS core

Market, deal, and intelligence operators

Underwriting, buyer acquisition, seller acquisition, routing, verification, outreach, voice, research, and monitoring.

GHOSTHAWKVIPERATLASSENTINELECHOSIRENPHANTOMSTORYTELLERPATHFINDERORACLE-INTELWATCHDOGQUARTERMASTERPROPHET
Functional leads

Finance, compliance, sales, ops, and technology

Workforce roles covering treasury, legal, brand, routing, infrastructure, resource management, and execution discipline.

TREASURERLEDGERTAXMANAUDITORCOUNSELORMARSHALSHIELDCLOSERDIPLOMATCATALYSTAMPLIFIERGUARDIANSPIDERFORGECONDUCTORINSPECTORARCHITECTLIBRARIANLOCKSMITHMECHANICNEGOTIATORDISPATCHERSCOUTWARDENBEACONOVERWATCH
Analytics + search

Pattern, ranking, and forecasting specialists

Prediction, decoding, keyword discovery, content shaping, local search, and long-horizon competitive analysis.

SEERPROPHETVISIONCIPHERMATRIXINDEXERLINKFORGEKEYWORDMINERCONTENTFORGEGMBMANAGERCITATIONBOTREVIEWBOTLOCALRANKMAPPACKSERPTRACKERCOMPETITORSPYLONGTAILBOTTRENDBOTINTENTBOTGAPANALYZER
Field + site specialists

Acquisition, transactions, link building, and site-specific operators

Workers specialized for property hunting, transaction flow, backlink campaigns, and per-site execution.

REAPERLEGACYFALCONNEXUSOVERSEERAMBASSADORAEGISHERALDORACLEOUTREACHBOTGUESTPOSTBROKENLINKAUTHORITYBOTDISAVOWBOTREISITEBOTDISPOSITEBOTINVESTSITEBOTLANDINGSITEBOTPORTFOLIOBOT
Autonomy Levels

Autonomy is tiered, supervised, and legible instead of hidden behind one magic checkbox

Harness exposes two separate control planes: hierarchy tiers for who the agent is inside the workforce, and authority modes for how much freedom it has to act. That separation is what makes autonomy usable in real operations.

Tier 0-3Aria, Directors, Managers, Workers.

Tier 0 is Aria. Tier 1 is directors. Tier 2 is managers. Tier 3 is workers. The runtime knows the difference between strategy, delegation, and execution instead of flattening everyone into one blob.

3 modesHIGH_AUTONOMY, SUPERVISED, APPROVAL_REQUIRED.

Some agents can execute freely in their lane. Some are reviewed by Aria. Some escalate for human approval. This is how you get real agency without surrendering governance.

5 typesautonomous, chat-only, scheduled, event-driven, hybrid.

Not every agent should live the same life cycle. Harness gives each role the runtime shape that actually matches its job.

Hive Runtime

Coordination becomes part of the reasoning substrate, not a side dashboard no one checks

Hive Runtime turns isolated sessions into a coordinated field. Agents can register, heartbeat, lock files, send targeted or broadcast session messages, and receive unread coordination inside the next harness packet itself, where it can shape live reasoning instead of sitting ignored in a dashboard.

Cross-session communication should not live in a dashboard no one checks. It should arrive inside the next reasoning substrate so the agent sees it while thinking.

HIVE_SESSION_INBOX

OpenClaw workers stop being loose prompt-driven subprocesses and become governed operators with CONSULT -> BIND -> ACT.

OPENCLAW HARNESS BINDING

Locks

Prevent silent collisions

File and session locking means parallel workers do not stomp the same code path and call it collaboration after the damage is done.

Messaging

Broadcast and targeted coordination

Sessions can ask for handoff, signal blockers, announce intent, or resolve lock conflicts across the whole harness field.

48-hour technical preview

Try the full workforce runtime, not a toy-tier sandbox

The preview lasts 48 hours from the first successful heartbeat. No credit card. Bring your own model key. You get the real runtime: Aegis anti-hallucination, Hive coordination, memory continuity, verified proof telemetry, Lobster research lane, session locks, cross-session inbox, and workforce autonomy primitives as they actually exist. This is the operating surface that raises output quality in practice, not a stripped-down marketing tier.

Clock starts on first heartbeatNo credit cardBYO model keyAegis anti-hallucination activeLobster research lane included75-agent architectureRead-only after previewBuilt for real long-running work
What it solves

The AI industry problems this runtime actually attacks

This is not a decorative wrapper around frontier models. It is infrastructure for the failures that keep stopping AI systems from becoming trusted operational systems.

Hallucination

Stops agents from sounding confident while being wrong

  • Aegis gates before action and emission
  • Receipts and lineage instead of improvised certainty
  • Proof telemetry for blocked, intercepted, and passing turns
Research

Stops planning from happening in an evidence vacuum

  • Lobster external recon and source triangulation
  • Competitive and domain sweeps for decision support
  • Evidence packets returned into the runtime, not pasted from the side
Continuity

Stops every new session from becoming a partial lobotomy

  • Memory substrate and continuity receipts
  • Session-aware runtime state
  • Long-horizon task survival beyond one chat window
Authority

Stops autonomy from being either fake or terrifying

  • Tiered hierarchy for workforce roles
  • HIGH_AUTONOMY, SUPERVISED, APPROVAL_REQUIRED
  • Governed execution instead of hidden free-for-all behavior
Developer surfaces

Turns coding platforms into governed workspaces

  • Codex, Claude Code, aria-cli, and harness-linked workers
  • Proof lines, metrics, and runtime receipts
  • A single runtime fabric across different coding surfaces

The point is not that Aria Harness makes AI prettier. The point is that it makes AI more governable, more coherent, harder to delude, and much more capable of acting like a real workforce.

Runtime Proof
Loading verified harness runtime, SDK, and coding-surface metrics...
waitingactor mix pending
Expected surfaces: Codex, Claude Code, OpenCode, aria-cli, and harness-linked workers.
Loadingruntime proof payload