Monitoring
Host and service visibility with a small footprint.
Why this matters
Monitoring is not there to look impressive. It exists so someone reviewing or operating the platform can answer
three questions quickly: Is the host healthy? Are the services reachable? Where should I look first?
Prometheus
Collects host and service metrics with intentionally short retention and size limits.
Grafana
Shared UI for metrics and logs, protected behind Nginx rather than exposed directly.
Node Exporter
Provides host-level metrics for CPU, memory, disk, and filesystem behavior.
Uptime Kuma
Used for availability checks and a dedicated status surface on its own hostname.
cAdvisor
Supplies container-level CPU and memory metrics so Grafana can show workload behavior instead of host metrics alone.
Alertmanager
Receives Prometheus alerts and keeps alert routing simple, even while notification receivers remain intentionally minimal.