How Kubestead works
A Kubernetes operator that sits in your cluster, reads metrics from Prometheus or Datadog, and makes canary advancement and rollback decisions automatically. It is not a proxy, not a sidecar, and not in your request path — it manages replica counts and watches your existing metrics endpoints.
The architecture
Install (5 min)
kubectl apply -f kubestead-controller.yaml — no sidecar, no additional agent. Kubestead reads existing Kubernetes events and your metrics endpoint.
Write a RolloutPolicy
A single CRD in your repo defines canary steps, metric sources, thresholds, and rollback behavior. Everything is code-reviewed before it affects production.
Canary traffic splitting
Kubestead manages replica counts to split traffic at configurable percentages — no service mesh required. When you need sub-percentage canary slices, weighted routing via Istio, Linkerd, or Envoy Gateway is supported. Works with any ingress: nginx, Istio, Envoy Gateway.
Metric evaluation window
After each step, Kubestead waits a configurable soak time (default 5 min) and queries your metrics. Pass → advance to next step. Fail → instant rollback, rollback report generated automatically.
Audit trail
Every canary decision — step advancement, threshold breach, rollback trigger, or manual override — is written to a structured, tamper-evident JSON event log. The log includes the exact metric query result and timestamp at the moment of the decision. Exportable via webhook to your SIEM. Available in all paid plans.
Common questions
pause instead — meaning the canary halts at the current step and waits for metrics to recover — by setting onMetricUnavailable: pause in your RolloutPolicy.