Skip to content

Codex Docs

Production Deployment Runbook

Aries-Serpent/_codex_

Production Deployment Runbook¶

Prerequisites¶

Kubernetes cluster with GPU node pool available
Helm 3.x installed locally
Docker registry credentials configured via docker login

Deployment Steps¶

Build image:

./scripts/deploy/orchestrate.sh build --gpu

Push to registry:
```
./scripts/deploy/orchestrate.sh push
```

Deploy Helm chart:

helm upgrade --install codex deploy/helm/

Verify health probes:
```
kubectl get pods -l app=codex
```

Run smoke tests:

./scripts/deploy/orchestrate.sh run --dry-run
pytest tests/deployment/test_api_integration.py -k health

Rollback Procedure¶

Identify last known good chart version using helm history codex.
Roll back:
```
helm rollback codex <revision>
```
Validate /ready endpoint until status returns 200.

Observability¶

Metric	SLO	Monitoring
Availability	99.9%	Prometheus + Alertmanager
Latency p95	< 200ms	Grafana dashboards
Error Rate	< 0.1%	Sentry alerts

Incident Response¶

Escalation and communication steps are documented in docs/security/incident_response.md.