Agentra LabsAgentra Labs DocsPublic Documentation

AgenticContract

Benchmarks

Performance benchmarks for AgenticContract

All benchmarks measured with Criterion on Apple M-series (ARM64) hardware. Results represent median values from 100+ iterations with outlier detection enabled.

Test Environment

PropertyValue
HardwareApple M-series (ARM64)
Rust1.75+
FrameworkCriterion 0.5
OSmacOS 14+
Profilerelease (optimized)

Summary Results

At typical workloads, all core governance operations complete in under 1 microsecond. File I/O for 100 entities takes under 200 microseconds.

CategoryRepresentativeLatency
Policy evaluationSingle policy check49.3 ns
Risk limit checkSingle limit43.9 ns
Violation reportingWith severity1.01 us
Engine statisticsFull summary60.4 ns
File save100 entities154.5 us
File load100 entities96.7 us

Detailed Results: Core Operations

Single-operation benchmarks measure the cost of one governance primitive call.

OperationMedianNotes
policy_evaluate_single49.3 nsEvaluate one policy against one action
risk_limit_check43.9 nsCheck one limit against proposed amount
violation_report1.01 usCreate and store a violation record
engine_stats60.4 nsCompute summary statistics

Detailed Results: Scale Operations

Policy evaluation scales linearly with policy count.

OperationMedianNotes
policy_evaluate_scale/10684.5 nsEvaluate action against 10 policies
policy_evaluate_scale/1007.04 usEvaluate action against 100 policies
policy_evaluate_scale/100072.8 usEvaluate action against 1,000 policies

Scaling Analysis

Policy CountLatencyPer-Policy Cost
149.3 ns49.3 ns
10684.5 ns68.5 ns
1007.04 us70.4 ns
1,00072.8 us72.8 ns

The per-policy cost is essentially constant at ~70 ns, confirming O(n) linear scaling with minimal overhead.

Detailed Results: File I/O

Binary serialization benchmarks measure .acon file write and read performance.

OperationMedianNotes
file_save_100_entities154.5 usWrite 100 mixed entities to disk
file_load_100_entities96.7 usRead + BLAKE3 verify from disk

I/O Scaling Estimates

Entity CountSaveLoadFile Size
1~5 us~3 us~512 B
100154.5 us96.7 us~50 KB
1,000~1.5 ms~1 ms~500 KB
10,000~15 ms~10 ms~5 MB

Load includes BLAKE3 checksum verification. Save includes checksum computation and header update.

Comparison Context

AgenticContract is a local-first policy engine optimized for single-agent governance. It is not a distributed policy server.

SystemArchitectureTypical LatencyStorage
AgenticContractSingle-file binary, in-process49 ns (eval).acon file
OPA (Open Policy Agent)HTTP server, Rego evaluation0.5-5 msIn-memory
Cedar (AWS)Library, Cedar language1-10 usIn-memory

AgenticContract is 10-100x faster than OPA for policy evaluation because there is no network round-trip or language interpretation. Compared to Cedar, AgenticContract trades policy language expressiveness for simpler text-matching semantics.

Performance Tiers

TierLatency RangeOperations
Interactive< 1 usPolicy eval, risk check, stats
Batch1-100 usScale evaluation, violation report
I/O100 us - 15 msFile save/load (scales with size)

All interactive operations are suitable for hot-path enforcement (e.g., checking every API call against policies).

Memory Usage

The engine holds all entities in memory. Approximate memory footprint:

Entity CountHeap Usage
100~50 KB
1,000~500 KB
10,000~5 MB

For typical agent governance (10-100 active policies, a few dozen limits and obligations), memory usage is negligible.

Reproducing Benchmarks

Prerequisites

  • Rust 1.75+
  • Criterion 0.5 (included in dev-dependencies)

Running All Benchmarks

cd crates/agentic-contract
cargo bench

Running Specific Benchmarks

cargo bench -- policy_evaluate
cargo bench -- file_save
cargo bench -- risk_limit

Viewing Reports

Criterion generates HTML reports in target/criterion/. Open report/index.html for interactive charts.

open target/criterion/report/index.html

Custom Configurations

Set environment variables to adjust benchmark parameters:

BENCH_POLICY_COUNT=5000 cargo bench -- policy_evaluate_scale