12 - Performance Workbook

Purpose – define repeatable, data‑driven benchmarks that guard Stella Ops’ core pledge:

“P95 vulnerability feedback in ≤ 5 seconds.”

0 Benchmark Scope

Area	Included	Excluded
SBOM‑first scan	Trivy engine w/ warmed DB	Full image unpack ≥ 300 MB
Delta SBOM ⭑	Missing‑layer lookup & merge	Multi‑arch images
Policy eval ⭑	YAML → JSON → rule match	Rego (until GA)
Feed merge	NVD JSON 2023–2025	GHSA GraphQL (plugin)
Quota wait‑path	5 s soft‑wait, 60 s hard‑wait behaviour	Paid tiers (unlimited)
API latency	REST `/scan`, `/layers/missing`	UI SPA calls

⭑ = new in July 2025.

All P95 targets assume a single‑node deployment on this rig unless stated.

Phase (ID)	Target P95	Gate (CI)	Rationale
SBOM_FIRST	≤ 5 s	`hard`	Core UX promise.
IMAGE_UNPACK	≤ 10 s	`soft`	Fallback path for legacy flows.
DELTA_SBOM ⭑	≤ 1 s	`hard`	Needed to stay sub‑5 s for big bases.
POLICY_EVAL ⭑	≤ 50 ms	`hard`	Keeps gate latency invisible to users.
QUOTA_WAIT ⭑	soft ≤ 5 s hard ≤ 60 s	`hard`	Ensures graceful Free‑tier throttling.
SCHED_RESCAN	≤ 30 s	`soft`	Nightly batch – not user‑facing.
FEED_MERGE	≤ 60 s	`soft`	Off‑peak cron @ 01:00.
API_P95	≤ 200 ms	`hard`	UI snappiness.

Gate legend — hard: break CI if regression > 3 × target,
soft: raise warning & issue ticket.

Runner – perf/run.sh, accepts --phase and --samples.
Metrics – Prometheus + jq extracts; aggregated via scripts/aggregate.ts.
CI – GitLab CI job benchmark publishes JSON to bench‑artifacts/.
Visualisation – Grafana dashboard Stella‑Perf (provisioned JSON).

Note – harness mounts /var/cache/trivy tmpfs to avoid disk noise.

Phase	Samples	Mean (s)	P95 (s)	Target OK?
SBOM_FIRST	100	3.7	4.9	✅
IMAGE_UNPACK	50	6.4	9.2	✅
DELTA_SBOM	100	0.46	0.83	✅
POLICY_EVAL	1 000	0.021	0.041	✅
QUOTA_WAIT	80	4.0*	4.9*	✅
SCHED_RESCAN	10	18.3	24.9	✅
FEED_MERGE	3	38.1	41.0	✅
API_P95	20 000	0.087	0.143	✅

Data files: bench-artifacts/2025‑07‑14/phase‑stats.json.

0‑10 ms  ▇▇▇▇▇▇▇▇▇▇  38 %
10‑20 ms ▇▇▇▇▇▇▇▇▇▇  42 %
20‑40 ms ▇▇▇▇▇▇     17 %
40‑50 ms ▇           3 %

P99 = 48 ms. Meets 50 ms gate.

Perf trend spark‑line placeholder

Plot generated weekly by scripts/update‑trend.py; shows last 12 weeks P95 per phase.

Date	Note
2025‑07‑14	Added Δ‑SBOM & Policy Eval phases; updated targets & current results.
2025‑07‑12	First public workbook (SBOM‑first, image‑unpack, feed merge).