Rene Fichtmueller
|
db6b97186a
|
feat: OPN+spec equivalence matchers, 400G pricing, TIP_LLM training data
- Add OPN-based equivalence matcher robot (7,245 manufacturer-confirmed matches, confidence=1.0)
- Add spec-based equivalence matcher robot (683 matches, confidence=0.85)
- Matches by form_factor + speed_gbps + reach_tier + wavelength ±10nm
- Safety cap: skip FX products matching >30 competitors (too generic)
- Daily schedule: 04:30 UTC via pg-boss
- SQL migrations 116 (OPN) + 117 (spec) with tip_extract_wavelength_nm() + tip_reach_tier() helpers
- Fix tenGtek.ts: add 3 missing 400G categories (QSFP-DD, QSFP112) — closes pricing gap
- Generate tip-llm-pricing-v1.jsonl: 80 DB-grounded QA pairs (pricing, equivalences, 400G)
- Rebuild TIP_LLM training pool: 11,999 pairs (+127 vs prev), deployed to Erik
- FX product equivalence coverage: 88.1% (959/1089)
|
2026-05-13 21:33:19 +02:00 |
|
Rene Fichtmueller
|
8e367b3c33
|
feat: TIP_LLM 5-capability training data + updated system prompt
- Add scripts/seed-tip-llm-capabilities.ts: generator for 34 SFT pairs
covering all 5 TIP_LLM capabilities (transceiver research, switch
research, Blog_LLM data evaluation, crawler/scraper design, Hype Cycle)
- Add training-data/tip-llm-capabilities-v1.jsonl: generated output (34 pairs)
- Update tip-learning-pool-build.ts: expanded 5-capability system prompt
replaces single-line prompt; register capabilities file in files.tip_llm
- Regenerate tip_llm runpod outputs: 12141 raw pairs → 11872 training pairs
(up from 10654 before capabilities addition)
- Published tip_llm (11872 pairs) + blog_llm (11408 pairs) to HuggingFace
|
2026-04-26 00:01:21 +02:00 |
|