2026-04-06 02:55:34 +02:00

2.8 KiB

BlogLLM Training Data — Flexoptix Reference Articles

Gold-standard blog posts generated by Claude Sonnet (claude-sonnet-4-20250514) following the strict FO Blog Pipeline rules. These serve as reference examples for fine-tuning and training the BlogLLM.

Articles

File Title Type Score
blog-001-400g-dr4-price-war.md 400G DR4 Prices Are Moving... market_alert 9/10
blog-002-vendor-lock-in-optics.md The Hidden Tax in Your Transceiver Budget comparison 9/10
blog-003-silicon-photonics.md Silicon Photonics Is Shipping... technology_deep_dive 9/10
blog-004-400g-migration-fiber-plant.md Your 100G Fiber Plant Is Not Ready for 400G tutorial 9/10
blog-005-coherent-400zr-reality.md 400ZR Is Not What the Vendor Presentations Said technology_deep_dive 9/10
blog-006-dom-diagnostics.md Reading DOM Data Correctly tutorial 9/10
blog-007-800g-readiness.md 800G Is Shipping. Your Infrastructure Probably Isn't Ready. hype_cycle 9/10
blog-008-oem-vs-compatible-real-numbers.md OEM vs Compatible Transceivers: The Numbers Nobody Publishes buying_guide 9/10
blog-009-100g-to-400g-migration-what-breaks.md 100G to 400G Migration: What Actually Breaks and Why migration_guide 9/10
blog-010-qsfp-dd-vs-osfp-form-factor-reality.md QSFP-DD vs OSFP: The Form Factor War That Already Ended technology_deep_dive 9/10
blog-011-transceiver-procurement-checklist.md The Transceiver Procurement Checklist Nobody Gave You tutorial 9/10

Quality Rules Met (per article)

All articles were generated under strict constraints:

  • No markdown headers (##, ###) anywhere in body
  • No bullet lists as structural elements
  • No LaTeX formulas
  • No banned AI phrases ("leverage", "optimize", "game-changer", etc.)
  • No spec dumps or comparison tables
  • No OEM pricing presented as compatible pricing
  • No sales language ("BUY / AVOID", verdict blocks)
  • DR4 connector: MPO-12 (never LC)
  • DR4 wavelength: 1310nm (never 1550nm)
  • 400ZR and DR4 treated as distinct technologies
  • No per-port power figures >25W
  • No made-up part numbers
  • Only CMOS/physics-grounded values
  • One core thesis per article
  • Flexoptix FINAL OUTCOME TEST: reader finishes ready to validate properly, not defaulting to OEM

Usage for BlogLLM Training

  1. Import these as positive examples into the fine-tuning dataset
  2. Each article is ~800-1200 words (production blog length)
  3. Type field maps to generation template types in fo-blog-pipeline.ts
  4. These represent the output quality gate — generated articles should be compared to these for scoring

Adding More Training Data

Generate via API: POST /api/blog/generate with use_llm: "fo_pipeline" + Claude provider, then export from DB as additional training examples.