transceiver-db/blog-training-data
Rene Fichtmueller 772ce2074d feat: add blog training articles 056-100 for fo-blog-v3 fine-tuning
45 expert articles covering: Cisco/Juniper/Arista optic compatibility mechanics,
100G/400G/800G optics selection, DWDM/ROADM/WSS architecture, fiber standards,
coherent pluggables, AI cluster optics, carrier timing, EEPROM programming,
market pricing 2026, hyperscale procurement, transceiver failure analysis, and more.
2026-04-07 08:59:16 +02:00
..

BlogLLM Training Data — Flexoptix Reference Articles

Gold-standard blog posts generated by Claude Sonnet (claude-sonnet-4-20250514) following the strict FO Blog Pipeline rules. These serve as reference examples for fine-tuning and training the BlogLLM.

Articles

File Title Type Score
blog-001-400g-dr4-price-war.md 400G DR4 Prices Are Moving... market_alert 9/10
blog-002-vendor-lock-in-optics.md The Hidden Tax in Your Transceiver Budget comparison 9/10
blog-003-silicon-photonics.md Silicon Photonics Is Shipping... technology_deep_dive 9/10
blog-004-400g-migration-fiber-plant.md Your 100G Fiber Plant Is Not Ready for 400G tutorial 9/10
blog-005-coherent-400zr-reality.md 400ZR Is Not What the Vendor Presentations Said technology_deep_dive 9/10
blog-006-dom-diagnostics.md Reading DOM Data Correctly tutorial 9/10
blog-007-800g-readiness.md 800G Is Shipping. Your Infrastructure Probably Isn't Ready. hype_cycle 9/10
blog-008-oem-vs-compatible-real-numbers.md OEM vs Compatible Transceivers: The Numbers Nobody Publishes buying_guide 9/10
blog-009-100g-to-400g-migration-what-breaks.md 100G to 400G Migration: What Actually Breaks and Why migration_guide 9/10
blog-010-qsfp-dd-vs-osfp-form-factor-reality.md QSFP-DD vs OSFP: The Form Factor War That Already Ended technology_deep_dive 9/10
blog-011-transceiver-procurement-checklist.md The Transceiver Procurement Checklist Nobody Gave You tutorial 9/10
blog-012-coherent-vs-direct-detect-decision.md Coherent vs. Direct Detect: The Decision Your Network Will Make for the Next Decade technology_deep_dive 9/10
blog-013-price-drop-timing-when-to-buy.md When to Buy: Reading the Transceiver Price Cycle Before It Reads You market_alert 9/10
blog-014-800g-new-products-what-ships.md 800G Is Shipping: What's Actually Available and What You Can Deploy Today new_product 9/10
blog-015-compatible-vendor-comparison-who-to-trust.md Compatible Transceiver Vendors in 2026: Who Does the Testing and Who Just Says They Do competitor_analysis 9/10

Quality Rules Met (per article)

All articles were generated under strict constraints:

  • No markdown headers (##, ###) anywhere in body
  • No bullet lists as structural elements
  • No LaTeX formulas
  • No banned AI phrases ("leverage", "optimize", "game-changer", etc.)
  • No spec dumps or comparison tables
  • No OEM pricing presented as compatible pricing
  • No sales language ("BUY / AVOID", verdict blocks)
  • DR4 connector: MPO-12 (never LC)
  • DR4 wavelength: 1310nm (never 1550nm)
  • 400ZR and DR4 treated as distinct technologies
  • No per-port power figures >25W
  • No made-up part numbers
  • Only CMOS/physics-grounded values
  • One core thesis per article
  • Flexoptix FINAL OUTCOME TEST: reader finishes ready to validate properly, not defaulting to OEM

Usage for BlogLLM Training

  1. Import these as positive examples into the fine-tuning dataset
  2. Each article is ~800-1200 words (production blog length)
  3. Type field maps to generation template types in fo-blog-pipeline.ts
  4. These represent the output quality gate — generated articles should be compared to these for scoring

Adding More Training Data

Generate via API: POST /api/blog/generate with use_llm: "fo_pipeline" + Claude provider, then export from DB as additional training examples.