Skip to main content

Reading Optimisation Reports

The optimisation suite monitors all 8 agents for drift, consistency, and effectiveness. Reports post to #agent-optimisation on Discord. Here’s how to read them.

Report Types

Universal Reports (U-series)

These run for every agent and check foundational health metrics.
ReportFrequencyWhat It Checks
U-01: Prompt DriftWeekly (Sun)Has the agent’s behaviour drifted from its original spec?
U-02: Memory HygieneDailyIs the agent’s memory clean and well-organised?
U-03: Voice ConsistencyFortnightlyDoes the agent still sound like itself?
U-04: Context BudgetMonthlyIs the agent using its context window efficiently?
U-05: ADHD ProtocolWeekly (Sun)Are ADHD protocols being activated and working?
U-06: Council HandoffWeekly (Sun)Are agent-to-agent consultations working smoothly?
U-07: Tool RelevanceMonthlyAre the agent’s tools still appropriate?
U-08: Regression TestingMonthlyHas response quality degraded?
U-09: Personality EntropyMonthlyIs the agent’s personality stable or drifting?
U-10: Config DriftQuarterlyHas the agent’s config diverged from the canonical spec?

Group Reports (G-series)

These check how groups of agents work together.
ReportAgentsWhat It Checks
G-01: Shipping VelocityAnvil + SentinelAre we building and deploying at a healthy pace?
G-02: Decision QualityCompass + Atlas + VaultAre strategic decisions being made well?
G-03: Growth & WellbeingTempo + CortexIs learning happening without burnout?
G-04: Content QualityBard + CortexIs content accurate and on-brand?

Individual Reports (I-series)

Deep dives for specific agents with unique concerns.
ReportAgentWhat It Checks
I-01: Debt/Shipping RatioAnvilBalance between new features and tech debt
I-02: AuthenticityBardIs content authentic vs. generic AI slop?
I-03: Priority AccuracyAtlasWere yesterday’s priority calls correct in hindsight?

How to Read a Report

Reports in Discord follow a consistent format:
📊 OPTIMISATION: U-01 Prompt Drift — Anvil 🛠️
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Status: ✅ OK  |  Score: 0.91  |  Trend: stable

Dimensions:
  Tone:        0.93 ✅  (within bounds)
  Scope:       0.88 ✅  (within bounds)
  Protocol:    0.95 ✅  (strong)
  Consistency: 0.89 ✅  (within bounds)

Recommendations: None — Anvil is operating within spec.

Cost: $0.03  |  Tokens: 1,247

Status Meanings

StatusColourMeaningAction Needed
OKGreenEverything normalNone
WARNINGYellowSomething is drifting but not criticalKeep an eye on it
ACTION_REQUIREDRedNeeds human attentionRead recommendations and act
SKIPPEDGreyRate-limited or cost-cappedWill retry next cycle

Score Ranges

  • 0.90 - 1.00 — Excellent, operating within spec
  • 0.75 - 0.89 — Good, minor drift detected
  • 0.50 - 0.74 — Warning zone, review recommendations
  • Below 0.50 — Critical, likely needs manual intervention

Trend Indicators

  • improving — Score is going up over recent runs
  • stable — Score is consistent
  • worsening — Score is declining — pay attention

What to Do When You See ACTION_REQUIRED

1

Read the recommendations

The report includes specific suggestions. Most are straightforward: “update the agent’s memory”, “review the prompt for scope creep”, etc.
2

Check the Airtable history

Open the “Optimisation Runs” table in Airtable to see the trend. Is this a one-off or a pattern?
3

Flag to the team

If you’re not sure what to do, share the Discord message in your team channel. The platform team can investigate.

Cost Tracking

Every report shows its cost in USD. The system has built-in limits:
LimitDefaultWhat Happens When Hit
Daily cost cap$5.00Remaining runs skip until tomorrow
Hourly rate limit50 callsRuns queue until next hour
Per-agent cost cap$1.00/dayThat agent’s runs skip
If you see lots of “SKIPPED” reports, the cost cap has been hit. This is by design — it prevents runaway API costs.
The cost of the full suite running at default schedules is approximately $2-3/day. If you see costs consistently higher, something may be running more often than expected.