llm-gateway/CHANGELOG_PENDING.md
Rene Fichtmueller 2ca77d0aee feat: Phase 2F — Multi-Agent Integration (ADRs + Client Fallback + Tests)
- ADR-0001: Multi-Agent Coworking Architecture with LLM Gateway Orchestrator
- ADR-0002: Tier Assignment Strategy for Model Selection (cost-first escalation)
- ADR-0003: Confidence Gate Thresholds & Learning Cycle Intervals (6h/12h/24h cycles)
- ADR-0004: External Provider Fallback Chain Ordering (Cerebras → Groq → Mistral)
- Enhanced client SDK: Offline Ollama fallback, health checks, exponential backoff retry
- Integration tests: claude-code-integration.test.ts (14 test cases)
- PHASE_2F_DEPLOYMENT.md: Pre-deployment checklist, automated deploy, rollback plan
- Post-deployment verification procedures for health, client fallback, metrics
2026-04-19 21:39:44 +02:00

23 lines
3.4 KiB
Markdown

# LLM Gateway — Pending Changelog Entries
## 2026-04-16
{"d":"2026-04-16","t":"FEAT","m":"MAGATAMA まがたま security intelligence model integrated: magatama:32b in models.yaml (large tier, 131k ctx), 6 routing rules (threat_analysis, ciso_report, compliance_gap, incident_response, bgp_security, vuln_triage), 6 prompt templates with full TEPPEKI doctrine (MITRE ATT&CK, NIS2, CIA Triad, CVSS v3.1, Kill Chain)"}
{"d":"2026-04-16","t":"AI","m":"MAGATAMA LoRA adapter (r=8, alpha=16) fine-tuned on Qwen2.5-32B-Instruct with 22831 security samples, converted to GGUF format for llama-server runtime deployment with --lora flag on Mac Studio port 11435"}
## 2026-04-02
{"d":"2026-04-02","t":"FEAT","m":"Fine-tuner SSH tunnel launch script (scripts/start.sh): opens SSH tunnel to Erik:5432 before running fine-tuner, bypassing IONOS firewall"}
{"d":"2026-04-02","t":"FIX","m":"Fine-tuner env vars: FT_DB_URL/FT_GATEWAY_URL/FT_OLLAMA_URL (not DATABASE_URL) — fine-tuner status command now works"}
{"d":"2026-04-02","t":"FIX","m":"DB schema migration: added status, confidence_score, used_in_training, system_prompt, input_text, output_text, human_edited, edited_output, used_in_dpo_training to learning_corpus; added run_type, task_type, started_at, deployed_model_name to fine_tuning_runs"}
{"d":"2026-04-02","t":"FEAT","m":"CtxHealth: new workspace package packages/ctx-health — TypeScript self-healing daemon monitoring PM2, PostgreSQL, Ollama, Cloudflare tunnel, disk, memory, network, WireGuard every 60s; LLM-powered diagnosis via ctxhealer caller; auto-healing with cooldown + escalation guards; incidents persisted to ctx_health_incidents + ctx_health_status tables; dry-run mode; PM2 process name: ctx-health"}
{"d":"2026-04-02","t":"FIX","m":"Template variable resolution: input is now mapped as fallback for all 20+ content variable names (ocr_text, alert_data, bgp_data, anomaly_data, etc.) — all 7 project templates work with simple string input"}
{"d":"2026-04-02","t":"FIX","m":"24 templates updated: {{input}} added as primary content variable so simple callers work without knowing domain-specific variable names (nog_cfp_evaluate, pc_as_narrative, sb_root_cause, shieldx_false_positive, etc.)"}
{"d":"2026-04-02","t":"FIX","m":"Prompt resolver: input also aliased as source_data fallback; context fields spread into template vars"}
{"d":"2026-04-02","t":"DATA","m":"DB schema migration run on Erik: learning_corpus and fine_tuning_runs now compatible with Python fine-tuner"}
{"d":"2026-04-02","t":"FEAT","m":"Full E2E test: all 7 projects verified (TIP, EO, PC, SB, NOGnet, ShieldX, CtxEvent) — 100% approved status, avg confidence 8.25/10, 70 calls in audit log"}
{"d":"2026-04-06","t":"AI","m":"BlogLLM v1 (fo-blog-v1-qwen7b Q5_K_M): GGUF conversion + Ollama registration complete. Test result: pipeline runs end-to-end, technical content OK, but format compliance fails (bullet lists despite NO-BULLET instruction, training prompt leakage). Root cause: max_seq_length=1024 during training cut articles at ~512 words. Fix applied: max_seq_length=2048 for next run. Re-train needed before production."}
{"d":"2026-04-06","t":"FIX","m":"TIP: Reset 298 stale details_verified flags (reach_label empty). Runtime check in dashboard: dVer gated on non-empty reach_label. Wrong image cleared for O.Czz8HG.z.R."}
{"d":"2026-04-06","t":"FIX","m":"TIP comparable prices: treat reach_meters=0 same as NULL in API query — 800G OSFP FS.com prices now visible on Flexoptix product pages."}