Implement complete monitoring infrastructure following rick-infra principles: Components: - VictoriaMetrics: Prometheus-compatible TSDB (7x less RAM usage) - Grafana: Visualization dashboard with Authentik OAuth/OIDC integration - node_exporter: System metrics collection (CPU, memory, disk, network) Architecture: - All services run as native systemd binaries (no containers) - localhost-only binding for security - Grafana uses native OAuth integration with Authentik (not forward_auth) - Full systemd security hardening enabled - Proxied via Caddy at metrics.jnss.me with HTTPS Role Features: - Unified metrics role (single role for complete stack) - Automatic role mapping via Authentik groups: - authentik Admins OR grafana-admins -> Admin access - grafana-editors -> Editor access - All others -> Viewer access - VictoriaMetrics auto-provisioned as default Grafana datasource - 12-month metrics retention by default - Comprehensive documentation included Security: - OAuth/OIDC SSO via Authentik - All metrics services bind to 127.0.0.1 only - systemd hardening (NoNewPrivileges, ProtectSystem, etc.) - Grafana accessible only via Caddy HTTPS proxy Documentation: - roles/metrics/README.md: Complete role documentation - docs/metrics-deployment-guide.md: Step-by-step deployment guide Configuration: - Updated rick-infra.yml to include metrics deployment - Grafana port set to 3001 (Gitea uses 3000) - Ready for multi-host expansion (designed for future node_exporter deployment to production hosts)
23 lines
721 B
Django/Jinja
23 lines
721 B
Django/Jinja
global:
|
|
scrape_interval: {{ victoriametrics_scrape_interval }}
|
|
scrape_timeout: {{ victoriametrics_scrape_timeout }}
|
|
external_labels:
|
|
environment: '{{ "homelab" if inventory_hostname in groups["homelab"] else "production" }}'
|
|
host: '{{ inventory_hostname }}'
|
|
|
|
scrape_configs:
|
|
# VictoriaMetrics self-monitoring
|
|
- job_name: 'victoriametrics'
|
|
static_configs:
|
|
- targets: ['{{ victoriametrics_listen_address }}']
|
|
labels:
|
|
service: 'victoriametrics'
|
|
|
|
# Node exporter for system metrics
|
|
- job_name: 'node'
|
|
static_configs:
|
|
- targets: ['{{ node_exporter_listen_address }}']
|
|
labels:
|
|
service: 'node_exporter'
|
|
instance: '{{ inventory_hostname }}'
|