Operations
Standard Procedures
The operating procedures every team member follows. They make outcomes reproducible, auditable, and safe.
SOP-001 — Development workflow
- Pick a ticket from the current sprint; move it to In progress.
- Create a branch
feat/<ticket-id>-short-slug. - Write tests first when modifying domain logic; cover edges.
- Open a draft PR early; CI must pass (lint, typecheck, unit, e2e on critical paths).
- Request review from a domain owner; at least one approval required.
- Squash-merge with a Conventional Commit message; preview deploys auto-publish.
SOP-002 — Release
- Releases ship every weekday at 14:00 UTC; hotfixes any time.
- Release captain rotates weekly; they own the release notes and rollback decision.
- Tag
vYYYY.MM.DD.N; staging soaks for 30 min before promotion. - Production deploy is gated by a green synthetic monitor run.
- Post-release: monitor RED metrics dashboard for 30 min; close release in changelog.
SOP-003 — Incident response
- Anyone can declare an incident in
#incidentswith severity. - Incident commander assigned within 5 min for SEV-1 / SEV-2.
- Communications lead posts customer updates every 30 min on the status page.
- Mitigate first, fix later; rollback is always an acceptable first step.
- Postmortem within 5 business days; blameless, action items tracked to completion.
| Severity | Definition | Response |
|---|---|---|
| SEV-1 | Platform down or data loss | Page on-call, all-hands |
| SEV-2 | Major feature broken for > 5 % users | Page on-call |
| SEV-3 | Degraded, workaround exists | Next business day |
SOP-004 — Data change management
- Production data changes require a ticket and a peer-reviewed migration script.
- Run on staging first; capture before/after row counts.
- Apply with a transaction; have rollback SQL ready.
- Log the change in the data-ops journal with diff and approver.
SOP-005 — MRV sign-off
- Measurements are imported and auto-enriched with satellite layers.
- Independent verifier reviews evidence; rejects with reason or approves.
- Approved batches are hash-stamped; the hash is anchored to the audit log.
- Registry export reviewed by methodology owner before pushing.
- Issuance recorded in marketplace inventory only after registry confirms.
SOP-006 — Access management
- Access is granted by role, not by individual permissions.
- New hires onboarded via the access-request workflow; manager approves.
- Quarterly access review by tenant admin; revocations applied within 24 h.
- Offboarding triggers immediate session invalidation and device wipe.
SOP-007 — Backups & disaster recovery
- Continuous WAL shipping to a separate region.
- Nightly snapshots retained for 35 days; monthly for 12 months.
- Quarterly restore drill against an isolated environment; result published internally.