llm-gateway

Author	SHA1	Message	Date
Rene Fichtmueller	2ca77d0aee	feat: Phase 2F — Multi-Agent Integration (ADRs + Client Fallback + Tests) - ADR-0001: Multi-Agent Coworking Architecture with LLM Gateway Orchestrator - ADR-0002: Tier Assignment Strategy for Model Selection (cost-first escalation) - ADR-0003: Confidence Gate Thresholds & Learning Cycle Intervals (6h/12h/24h cycles) - ADR-0004: External Provider Fallback Chain Ordering (Cerebras → Groq → Mistral) - Enhanced client SDK: Offline Ollama fallback, health checks, exponential backoff retry - Integration tests: claude-code-integration.test.ts (14 test cases) - PHASE_2F_DEPLOYMENT.md: Pre-deployment checklist, automated deploy, rollback plan - Post-deployment verification procedures for health, client fallback, metrics	2026-04-19 21:39:44 +02:00
Rene Fichtmueller	b4593b6582	feat: integrate real @shieldx/core library into gateway pipeline Replace recursive HTTP-based ShieldX scan with direct library integration. - 547+ rules, 50+ languages, sub-millisecond scans - Enables: rules, entropy, indirect injection, behavioral, unicode, tokenizer, compressed payload detection - Disables Ollama-dependent scanners for zero external dependency - Response now includes threat_level, kill_chain_phase, shieldx_latency_ms	2026-04-07 09:03:02 +02:00
Rene Fichtmueller	e0b9fa1f53	feat: add CtxHealth self-healing daemon as new workspace package New package @llm-gateway/ctx-health (packages/ctx-health/) — a TypeScript infrastructure monitoring and auto-healing daemon. Monitors 8 subsystems every 60s (PM2, PostgreSQL, Ollama, Cloudflare tunnel, disk, memory, network, WireGuard), gets AI-powered root cause analysis via the gateway (ctxhealer caller / ctx_health_diagnose task_type), executes healing actions with cooldown (5min) and escalation guards (3+ failures → human escalation), persists all incidents to ctx_health_incidents and ctx_health_status tables. Dry-run mode via CTX_HEALTH_DRY_RUN=true. Runs as ctx-health PM2 process on Erik server.	2026-04-03 00:16:08 +02:00
Rene Fichtmueller	3a00ff4d33	feat: initial llm-gateway implementation - Complete Fastify gateway with 8-stage pipeline - Circuit breaker (opossum) per model tier - Rate limiting per caller - Ban list validation (EN/DE/auto-detected) - TIP validator (SFF-8024, part numbers, wavelengths) - Prometheus metrics - pg-boss async queue - PostgreSQL audit log + review queue - 9 prompt templates (TIP, LinkedIn, ShieldX) - Learning engine scaffolding - Auto-learning: ban-list, few-shot, routing, prompt optimizer	2026-04-02 22:48:55 +02:00

4 Commits