llm-gateway/sync/history/2026-05-12-gateway-health-check.md
2026-05-12 21:42:43 +02:00

3.7 KiB

Session: LLM Gateway Health Check — 2026-05-12

Agent: Codex Status: Partial success: gateway works, but not all desktop AI clients are captured.

Checks Performed

  • Read sync/CURRENT.md first and treated it as the binding handoff state.
  • Checked public gateway surfaces:
    • /api/dashboard/health
    • /v1/models
    • /v1/chat/completions
  • Queried dashboard-only endpoints using the dashboard token internally without printing it:
    • /api/dashboard/providers
    • /api/dashboard/subscriptions
    • /api/dashboard/requests
    • /api/dashboard/request-metrics
    • /api/dashboard/clients
  • Checked PM2 status on Erik.
  • Checked bridge health for Claude, OpenAI/ChatGPT, GitHub Copilot, Codex, and Microsoft 365 Copilot.

Verified Working

  • Gateway process is online in PM2.
  • Dashboard health returns ok.
  • Database is connected.
  • /v1/models returns the configured model list.
  • A live smoke request to /v1/chat/completions succeeded:
caller: codex-live-gateway-check
model: qwen2.5:14b
response: gateway-check-ok
tokens_in: 83
tokens_out: 4
latency_ms: 8363
  • The smoke request was immediately visible in dashboard request tracking.
  • Daily request metrics were available:
total_requests: 8
total_tokens: 4996
success_rate: 1
estimated_api_cost_avoided: 0.033817
compression_operations: 3
compression_tokens_saved: 25
top_model: qwen2.5:14b

Not Fully Working

  • The gateway is not currently capturing every desktop AI interaction.
  • Dashboard client detection showed:
codex-desktop: live, 2 tracked requests
claude-desktop: live, 3 tracked requests
microsoft-copilot: running, 0 gateway requests
github-copilot: running, 0 gateway requests
chatgpt: not-connected
openai-compatible: live, 1 tracked request
  • Codex and M365 bridge URLs are configured to MacStudio LAN addresses:
CODEX_BRIDGE_URL=http://192.168.178.213:3253
OPENAI_CODEX_URL=http://192.168.178.213:3253
M365_COPILOT_BRIDGE_URL=http://192.168.178.213:3257
  • Erik could not reach either MacStudio bridge during the check:
192.168.178.213:3253 unreachable
192.168.178.213:3257 unreachable
  • Local Mac checks also showed nothing listening on:
127.0.0.1:3253
127.0.0.1:3257
  • GitHub Copilot bridge on Erik is online but returns:
auth_required
  • Microsoft 365 Copilot bridge is configured/running but requires Microsoft Graph auth:
auth_required
Set MICROSOFT_CLIENT_ID or M365_COPILOT_ACCESS_TOKEN.

Security Decision

Codex attempted to restart the local Codex bridge on the MacStudio bound to 0.0.0.0:3253, but the action was rejected by policy because it would persistently expose local Codex subscription access to the LAN via PM2.

Do not work around this. Safer options:

  • Start a local bridge bound only to 127.0.0.1 for local-only tests.
  • Use an authenticated/restricted tunnel between Erik and MacStudio.
  • Bind to LAN only after explicit user approval and a narrow firewall/source-IP rule.

Conclusion

LLM Gateway itself works and tracks requests that pass through it. It does not currently "take everything" because several desktop/subscription clients are either not routed through the gateway, not authenticated, or their MacStudio bridges are unreachable.

Next Actions

  1. Decide the safe connectivity model for MacStudio bridges:
    • restricted LAN bind,
    • SSH tunnel,
    • VPN-only route,
    • or local-only.
  2. Re-authenticate GitHub Copilot bridge.
  3. Provide Microsoft Graph app/token for M365 Copilot if that bridge should become functional.
  4. Configure actual desktop clients to call https://llm-gateway.context-x.org/v1 if their tokens should be counted and compressed.
  5. Run another smoke test after bridge connectivity is restored.