Platform
Health
Backend health surface: latency histogram, throughput, recent 5xx, alert rules, Slack webhook.
Features
- **API usage by endpoint + key** — top callers, slow endpoints, error rates per project.
- **Delivery health per channel** — push delivered / opened / failed, email bounced / opened / clicked, SMS delivered / failed.
- **Latency + error budgets** — p50 / p95 / p99 across modules.
- **AI anomaly alerts** via email / Slack with per-org webhook routing — picks up traffic dips, send-rate spikes, error-rate excursions.
- **Audit log** with actor / resource-type / date-range filters + expandable metadata viewer.
- **Top endpoints (24h) + webhook tester + SSRF-guarded outbound probe** — debug integrations safely.
- **Promote-failed + workflow-failed alerts** surface broken Automation runs before users notice.
Common use cases
- Engineering health dashboards for the Sendora-backed parts of your stack.
- Ops alerting on delivery degradation (sudden push-failure spike from Apple, email bounce-rate breach).
- Compliance audit trail — auditor-ready filterable log of every actor action.
Observability
Read health snapshot
FREEReal-time uptime + p99 latency + error rate. Polls the same source the operator dashboard renders.
curl "https://api.sendoracloud.com/api/v1/orgs/<ORG_ID>/observability/health" \
-H "x-api-key: pk_prod_…"Define an alert rule
GROWTH+Auto-route an alert to your Slack / PagerDuty when a metric breaches a threshold. AI anomaly-detector also runs continuously and can fire alerts on its own.
curl -X POST https://api.sendoracloud.com/api/v1/orgs/<ORG_ID>/observability/alert-rules \
-H "x-api-key: pk_prod_…" \
-d '{
"name": "p99 latency over 1s",
"metric": "latency_p99_ms",
"operator": "gt",
"threshold": 1000,
"windowMinutes": 5,
"channelType": "slack"
}'