How does AgentOS track cost?

Every task carries a cost budget as part of its execution contract. Spend is tracked per task at sub-penny precision and rolled up to the unit of work — the work order, the phase, the role, the model. Cost is a property of the task, not a monthly surprise from a vendor.

Why does it get cheaper over time?

Three loops: frontier spend trends down as memory absorbs explanations and local models handle a growing share of routine work; review collapses to minutes because work arrives with its own evidence packet; and the institutional record compounds, so tomorrow's agents inherit today's lessons without retraining.

What is model routing?

AgentOS routes tasks to models by task class and cost class, not by whoever shouted last. Cheap, fast local models handle bulk grounding and class-bounded work; frontier models handle decision-class reasoning. Frontier-model share is a KPI, not a year-end finding.

The cost of the work

AGENTOS · ECONOMICS

Economics

Cost per unit of work, not monthly AI spend.

Most platforms report a vendor invoice. AgentOS reports the cost of the work.

Every dollar attributed to the task that spent it, rolled up to the work order, the phase, the role, and the model. Frontier-model share is a KPI, not a year-end finding.

Become a design partner See attribution

The compounding insight

Cheaper and smarter the longer it runs.

Three loops that bend the cost curve down and the capability curve up at the same time. None of them require retraining.

LOOP 01 · INFERENCE COST

Frontier spend trends down.

Every expensive explanation is captured once and reused forever. As the memory layer fills, local models absorb a growing share of routine work, and frontier models get reserved for high-leverage reasoning.

100% frontier→~24% frontier

LOOP 02 · DEVELOPER LEVERAGE

Review collapses to minutes.

Work arrives with its own evidence packet. Reviewers verify the gates and spot-check the diff instead of re-reading every line, so throughput per engineer compounds.

days re-reading→minutes to verify

LOOP 03 · INSTITUTIONAL INTELLIGENCE

The system gets smarter, no retraining.

Decisions, approved patterns, and failure modes accumulate. Tomorrow's agents inherit today's lessons, and that knowledge survives engineer turnover.

each agent blind→each agent grounded

Per unit of work

Most platforms report monthly AI spend. AgentOS reports the cost of the work.

Spend is a property of a task, not a line on a vendor invoice. Every dollar is attributed to the unit of work that spent it, all the way up the work graph.

Cost per work order Cost per role Cost per phase Cost per model Cost per action

Planning

Architecture

Development

Governance

Routing by task class

The right model for the right task.

Cheap local models handle the high-volume, class-bounded work. Frontier models handle the high-leverage, decision-class reasoning. AgentOS routes by task class and cost class — never by whoever shouted loudest.

Local lane

Bulk, grounded, class-bounded.

Most of the work is routine: grounding lookups, embeddings, classification, summarization, formatting. Local models on your own hardware do this work at near-zero marginal cost.

Inference Fabric: 22× tokens per NVIDIA GPU
Bulk grounding · embedding · classification
Class-bounded execution

Frontier lane

Reserved for decision-class work.

When a task requires open reasoning, a frontier model is the right tool. AgentOS reserves the lane for that, so spend goes where leverage is.

Architecture, design, ambiguous QA
Adversarial review and adjudication
Frontier-model share tracked as a KPI

Cost discipline at the boundary

Budgets enforced where the work happens.

The contract that authorizes a task also bounds it. Budget is a required field, fail-closed when exhausted, and visible in real time.

Sub-penny precision

Spend tracked per task.

Token usage and model rates roll up per task in near-real time. No batch reconciliation, no surprise overruns at month-end.

Fail-closed at the gate

Budget exhausted means stop.

A task that runs out of budget halts at the boundary and surfaces. It does not silently borrow from the next task, the next phase, or the next month.

Frontier-model KPI

Cheap by default, expensive on purpose.

Frontier share is a tracked number. If it climbs, that is a signal something is escaping the routing rules, not a quarterly surprise.