Overview
Scrape each Velaxe app’s /metrics and roll up health into InsightLake; alert on SLOs.
Problem
Ops teams hop across apps to understand reliability; no single health view exists.
Solution
InsightLake ingests Prometheus metrics per app and exposes a workspace health board with alerting to PagerDuty/Slack/Email.
How it works
Enable /metrics ingestion, import starter Grafana JSON, and set alert rules (e.g., dashboard TTFB p95, ingestion lag).
Who is this for
SRE/Ops
Engineering
Expected outcomes
- Faster detection & resolution of incidents
- Shared reliability goals across teams
Key metrics
SLO violations detected
Baseline
2 per month
Target
0 per month
Dashboard TTFB p95
Baseline
4 seconds
Target
2.5 seconds
Gallery
Downloads & templates
Case studies
PlatformOps centralizes monitoring
One board to see ingestion lag, query latency, and success rates.
Software Mid-market NA
Security impact
- Aggregated counters/gauges from application /metrics · PII: none
Compliance
- SOC2 (monitoring & alerting)
Availability & next steps
Free
Pro
Enterprise