Engineering Evidence
Every claim on this site is backed by engineering evidence.
Not benchmarks. Not marketing. Artifacts.
9 capabilities · real artifacts · reproducible commands · source-of-truth links
Every claim on this site is backed by engineering evidence.
Not benchmarks. Not marketing. Artifacts.
9 capabilities · real artifacts · reproducible commands · source-of-truth links
Validated against the production runtime. The benchmark intentionally exceeded expected production behavior by driving individual retrieval surfaces at full concurrency.
Measured on production workloads, not synthetic benchmarks. 22× stock TensorRT-LLM throughput.
State survives the worker. The resume packet is regenerated from PostgreSQL on every stand-down.
Gates don't pass on assertions. They pass on files. Every claim is a path on disk.
Before any code change ships, the graph tells you the exact blast radius. Sub-second, no LLM.
Ground decisions in code, not model recall. Every hit returns file path, line range, and score.
Same PostgreSQL state regenerates the same packet bytes. Determinism is a property of the bytes.
Cost is a field on every task, not a line on an invoice. Sub-penny precision per call.
Every call routed by required task_type and schema_id. No ungoverned dispatches.
These nine capabilities are the foundation. More evidence entries land as the system grows. If you want to put your own work through AgentOS, become a design partner.