transceiver-db

Author	SHA1	Message	Date
Rene Fichtmueller	67310c8fe7	fix(blog): SPA-aware URL blog generation + dynamic generated_by - fetchUrlContent() now extracts OG/meta tags (og:title, og:description, name="description", og:site_name) as fallback content for JS-rendered SPAs - Returns spaDetected=true when body text < 300 chars after stripping scripts - from-url endpoint skips gatherBlogData() product injection when SPA detected, preventing fo-blog-v10 from defaulting to optical networking domain - additionalContext now includes SPA warning instructing LLM not to default to optical transceiver topics unless the page is actually about that - generated_by in pipeline UPDATE query now uses active model name instead of hardcoded 'fo-blog-engine-v7' (reads getLlmProvider().ollamaModel) - Dashboard shows SPA warning toast when spa_detected=true in response - Response now includes spa_detected field for client awareness	2026-05-14 12:29:17 +02:00
Rene Fichtmueller	e0f9656684	feat: Blog Engine — generate from URL (link → BlogLLM → article) New POST /api/blog/from-url endpoint: - Accepts url + topic in request body - Fetches page server-side (no CORS, 20s timeout, redirect-follow) - Strips script/style/nav/footer/svg; extracts readable text (~5000 chars) - Extracts page title from <title> or <h1> - Passes extracted content as structured additional_context to the existing 16-step FO blog pipeline (same as manual generation) - Returns immediately; LLM pipeline runs async - Validated: smoke test fetched flexoptix.net/en/blog/, 5040 chars, pipeline launched with llm_enhancing=true New "🔗 Blog aus URL generieren" panel in dashboard: - URL input (Enter key triggers generation) - Blog-Typ dropdown (same 8 types as manual panel) - Button shows loading state "⏳ Fetching…" during API call - Status line shows extracted char count after success - Reuses pollBlogLlm() for step-by-step progress polling - Inline status field for error display without toast spam	2026-05-14 00:55:35 +02:00
Rene Fichtmueller	048bf0dcf2	feat: add Codex task for Flexoptix reference matching overhaul CODEX-TASK-flexoptix-reference-matching.md — comprehensive plan to fix zero-match gap for ATGBICS/NADDOD/10Gtek/ShopFiber24 (8.260+ products with 0 Flexoptix equivalences). Root cause: 30-day price_observation window excludes vendors whose scrapers ran >30 days ago. Solution: catalog-reconcile robot (full bulk match, no time limit), form_factor normalization (SQL 108), 30→90 day window fix in nightly matcher, on-demand API endpoint. Expected: coverage from 22% → 45-60% after one reconcile run.	2026-05-13 16:51:53 +02:00
Rene Fichtmueller	10af2ca244	fix: generated_by tag — v6-length-fix → v7	2026-05-10 09:55:39 +02:00
Rene Fichtmueller	270bd12382	feat(dashboard): clickable LLM model selector — switch blog engine at runtime - client.ts: BLOG_LLM_PROVIDER/OLLAMA_LLM_MODEL as mutable state (setLlmProvider/ getLlmProvider). Reads blog-llm-settings.json on startup for persistence. All generate()/checkHealth()/chat() calls use dynamic provider() + llmModel() — no restart required for switches. - blog.ts: POST /api/blog/llm/switch endpoint — validates provider, calls setLlmProvider(), writes settings file, returns previous+active state. - index.html: all 4 model cards now clickable (cursor:pointer, hover fade). switchBlogLlm(provider, model) — disables cards during switch, shows green/red feedback toast, auto-refreshes status. SSH instruction removed.	2026-04-29 01:15:45 +02:00
Rene Fichtmueller	b5decc517f	fix: hard-cap blog generation to 800-1100 words LLM_OPTS.maxTokens 8192 → 1600, LLM_REFINE 6144 → 1800, Step 4 master draft 8192 → 1600. Added explicit word-count constraint to STEP4_MASTER_DRAFT prompt: HARD LIMIT 800-1100 words. Root cause: no token ceiling → fo-blog-v6 produced 4000-5000w articles. Generated-by label updated to fo-blog-engine-v6-length-fix.	2026-04-28 22:45:48 +02:00
Rene Fichtmueller	e9fcda2811	feat: wire finder.ts + switch-docs + Ollama LLM tools to MCP server MCP Server (packages/mcp-server/src/index.ts): - Register registerSwitchDocTools (switch-docs.ts) — switch documentation lookup - Register finderTools dynamically (finder.ts) — find_flexoptix_for_switch, get_competitor_alerts - Add analyze_market_with_llm tool: qwen2.5:14b via Ollama, enriched with live hype cycle + pricing + news - Add generate_blog_post tool: fo-blog-v5 (fine-tuned) with qwen2.5:14b fallback, enriched with live pricing data - OLLAMA_BASE_URL env var (default: https://ollama.fichtmueller.org) Also includes scraper improvements (ascentoptics, atgbics, gbics, skylane, ebay-enricher), API route updates (blog, blog-sll, health, hot-topics, transceivers, queries), and dashboard hot-topics refresh.	2026-04-18 00:21:58 +02:00
Rene Fichtmueller	fea0b0fb66	feat: blog engine v5 — Auto-Kill Layer, 16-step pipeline, longer content Upgrades FO Blog Pipeline from 14 to 16 steps: - NEW Step 8d: Auto-Kill Layer v1.0 (10 systematic categories A-J) - NEW Step 15: Auto-Kill Scoring (cleanliness, narrative, non-AI, relevance) - Updated banned phrases from Gold-standard editorial feedback - Soft Delete List for conditional phrases - Auto-Kill categories: spec blocks, formulas, section leakage, generic transitions, repeated concepts, SKU mentions, false authority, over-explained basics, whitepaper tone, fake precision Content length changes per user feedback: - Blog target: 1,200-2,000 words (was 700-1,000) — thorough and detailed - LinkedIn target: 2,000-2,800 chars (was 350-600) — use maximum length - Reduction pass: 25-30% cut (was 15-25%) — remove weak, keep depth	2026-04-04 11:02:45 +02:00
Rene Fichtmueller	ede4f5b966	feat: blog engine v3 — 8-stage pipeline with Auto-Kill Layer Complete rewrite of blog prompts and pipeline based on editorial Gold-standard feedback. Replaces 3-pass system with 8-stage pipeline: 1. Master generation (narrative voice, no spec dumps) 2. Narrative Control (kill visible structure, enforce flow) 3. Auto-Kill Layer (remove AI phrases, spec residue, repetition) 4. Reduction Engine (cut 40% — keep strongest ideas only) 5. Depth pass (add specifics where vague, no spec dumps) 6. Quality Control (hard delete list validation) 7. Procurement layer (optional, sales audience) 8. LinkedIn post generation (new) Key changes: - System prompt rewritten with Hard Delete List (29 banned phrases) - Soft Delete List for conditional phrases - Auto-Kill categories A-J (spec blocks, formulas, whitepaper tone, etc.) - Master prompts enforce continuous narrative, no section headings - Word count targets reduced (800-1200 instead of 1500+) - Scoring pass added (cleanliness, narrative, non-AI feel, relevance) - LinkedIn companion post auto-generated - Context data injection reduced (fewer items, no dump instructions)	2026-04-04 10:52:31 +02:00
Rene Fichtmueller	814325b349	feat: dashboard v2, blog expansion, market/cable MCP tools, switch asset scrapers, scraper utilities	2026-03-30 08:07:12 +02:00
Rene Fichtmueller	70447def02	feat: massive scraper expansion + hype cycle engine + lifecycle prediction New scrapers: - GBICS.com (BigCommerce, GBP prices, 10 categories, 78 products) - Juniper HCT (Next.js SSR parser, 475 transceivers with specs/EOL) - SFPcables.com (Magento store, 16 categories, 78 products) - Fluxlight (BigCommerce, 6 pages, 118 products) - Champion ONE (compatible vendor scraper) Scraper fixes: - 10Gtek: rewritten to parse HTML spec tables (152 products) - Flexoptix: fix price extraction from Magento Hyva HTML - Register all scrapers in CLI (--gbics, --juniper, --sfpcables, etc.) Hype Cycle Engine enhancements: - Data-driven enrichment from scraped vendor/price data - Revenue lifecycle prediction (peak year, decline, revenue index) - Regional adoption model (NA, China, APAC, Europe, RoW with lag coefficients) - New API endpoints: /enriched, /lifecycle, /regional/:tech DB growth: 89 → 1,168 transceivers, 0 → 416 prices, 6 vendors Qdrant: 1,162 products embedded with nomic-embed-text Research: Norton-Bass model, standards-to-market timelines, hype signals	2026-03-28 02:30:19 +13:00
Rene Fichtmueller	274b80a4f1	feat: Phase 7 — Blog generator + scraper scheduler activation Blog draft engine generates structured markdown from all Qdrant collections (products, news, FAQ, troubleshooting). Supports 4 topic types: hype_cycle, comparison, new_product, tutorial. - routes/blog.ts: POST /api/blog/generate, GET/PUT endpoints - ecosystem.config.js: Added tip-scraper PM2 process - Scraper scheduler (pg-boss) now running on Erik with 8 job queues - News scraper running every 6 hours on Erik	2026-03-28 00:32:08 +13:00

12 Commits