Spokes.wiki Search Graph Growth About

platform-ops-wiki

Log — Platform Ops Wiki

Append-only history. Each entry starts with ## [YYYY-MM-DD] <op> | <title> where <op> is ingest, query, lint, or split, so grep "^## \[" log.md | tail -5 works.

[2026-06-05] split | platform-ops-wiki created from _inbox cluster (3 sources)

Spun out by the hub router when the InfoQ piece netflix-service-topology arrived via Telegram — the third tight ops piece, hitting the ≥3 spin-out threshold for the platform-ops-sre cluster (the google-sre-agentic-ai park note had explicitly flagged that a 3rd would trigger a spin-out). Scaffolded from CLAUDE.template.md; domain = production platform engineering, SRE & observability for cloud-native distributed systems. Migrated and ingested all three (URL-only, source: true + url:):

[2026-06-05] lint | first health check (10 pages, day-of-spin-out)

Swept the new spoke for orphans, thin spots, @type specificity, missing cross-links, and contradictions. Findings + actions:

[2026-06-09] ingest | +3 observability substrate (OpenTelemetry, eBPF, Prometheus) — all-spokes cron test

Answered the “build vs buy the topology” + “eBPF operational cost” open questions with the off-the-shelf CNCF stack: opentelemetry (SoftwareApplication, src — vendor-neutral traces/metrics/logs standard, not a backend), ebpf (DefinedTerm, src — sandboxed in-kernel programs; verifier/JIT/maps; CAP_BPF + complexity + kernel-version costs), prometheus (SoftwareApplication, src — pull-based time series + PromQL; 2nd CNCF project; not billing-grade). Tied to netflix-service-topology/kubernetes-integration-tax. Synthesis open questions updated (gap remaining: the topology-graph assembly above the raw signals). url-only. 10 → 13 pages.

[2026-06-10] ingest | SLOs + GitOps + distributed tracing — all-spokes pass (one foundation per pillar)

Three foundational concepts the spoke referenced but never paged. service-level-objectives (DefinedTerm, source, Google SRE book) — SLI/SLO/SLA + error budgets; the quantification backbone the open questions wanted (reliability-vs-velocity as a measured control loop; toil/MTTR become budget math), and a reframe of the AIOps reliability paradox (agents under an error budget). gitops (DefinedTerm, source, OpenGitOps/CNCF) — Git as single source of truth; four principles (declarative, versioned-immutable, pulled, continuously reconciled; Argo CD/Flux); the deployment face of “seams, not components” and a structural cousin of the aiops control loop. distributed-tracing (DefinedTerm, source, OpenTelemetry) — spans/traces/context-propagation; the per-request view of the service-topology (topology ≈ traces summed over time), the third signal beside metrics/logs, one of Netflix’s three fused telemetry sources, and the source of latency SLIs. Together they close a loop: observability(tracing) → SLIs/SLOs → reconcile/operate(GitOps/AIOps). Folded into synthesis (new 2026-06-10 section) + index (3 DefinedTerm rows). No contradictions. 13 → 16 pages.

[2026-06-12] ingest | DORA metrics (Four Keys) — dora.dev

All-spokes daily expansion. Added dora-metrics (@type DefinedTerm) — the delivery-performance quantification framework completing the “Quantification” open question that service-level-objectives half-answered. SLOs measure the running service’s reliability; DORA measures the delivery pipeline: throughput (deploy frequency, change lead time) + stability (change fail rate, failed-deployment recovery time, deployment rework rate). Captured the “speed and stability are not tradeoffs” finding and the MTTR→“Failed Deployment Recovery Time” term shift. Wired to service-level-objectives (backlink) / gitops / aiops (gives the reliability-paradox a yardstick); synthesis note added; open question reframed (frameworks named, still want them applied to this spoke’s own MTTR/toil claims). 1 new page. Authoritative (Google DORA / State of DevOps). No contradictions.

[2026-06-12] ingest | Project-as-a-Service (Belastingdienst, InfoQ/KubeCon)

Telegram drop, routed → platform-ops-wiki (platform-engineering pillar). Added source project-as-a-service