honeyDueAPI

admin/honeyDueAPI

Fork 0

Files

T

History

Trey t 2004f9c5b2

Backend CI / Test (push) Has been cancelled

Details

Backend CI / Contract Tests (push) Has been cancelled

Details

Backend CI / Lint (push) Has been cancelled

Details

Backend CI / Secret Scanning (push) Has been cancelled

Details

Backend CI / Build (push) Has been cancelled

Details

fix(observability): relax vmagent liveness probe — was crash-looping every ~5m

The previous probe had timeoutSeconds=1 which is too tight for the
shell pipeline (sh + wget + grep + comparison). On a busy node the
wget call regularly exceeded 1s, the exec timed out, and 3 consecutive
timeouts triggered SIGTERM. Result: vmagent restarted ~5x per 30 min,
causing brief gaps that made the Grafana "Pods up" panel render 0
whenever a refresh happened to coincide with a restart.

The relaxed probe still catches the original failure mode (zero healthy
targets) but only kills the pod after 10 full minutes of consecutive
failure (5 attempts × 2 min period), not 3 minutes (3 × 1 min).

  timeoutSeconds: 1 → 5
  periodSeconds: 60 → 120
  failureThreshold: 3 → 5
  initialDelaySeconds: 120 → 180

Also added wget -T 4 inside the command so wget itself bounds its
network call to 4s — leaving 1s of slack within the 5s exec budget.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-13 00:39:23 -05:00

admin

Security hardening: TLS at origin, security headers, network policies, admin probe fix

2026-04-24 15:50:47 -05:00

api

Revert "deployment: extend api startup probe budget for direct-endpoint migrations"

2026-04-26 22:22:07 -05:00

ingress

deploy(ingress): drop obsolete scaffold ingress.yaml

2026-04-26 23:44:21 -05:00

migrate

Adopt pressly/goose for schema migrations