transceiver-db/sync/history/2026-05-09-tip-global-verification-continuation.md
2026-05-09 20:19:19 +02:00

1.9 KiB

TIP Global Verification Continuation — 2026-05-09

Scope

  • Continue TIP verification with deterministic scrapers/robots only.
  • Keep Erik safe; no heavy Playwright/proxmox-heavy wave.
  • Write learnings into the TIPLLM training pool.

Implemented

  • Repaired GAO Tek scraper for the current Woodmart product-card layout.
  • Excluded category URLs from active product verification/search counters.
  • Added a catalog-details verifier for complete source-backed OEM/catalog specs.
  • Fixed Flexoptix image backfill case sensitivity.
  • Expanded og:image backfill vendor coverage.
  • Hardened scheduler reconcile so category URLs are not promoted as details source.

Live Runs

  • GAO Tek:
    • 20 pages fetched.
    • 480 real product cards extracted.
    • 0 public prices found.
    • 6 category/non-product artifacts reset.
  • Priority pi-fetch wave:
    • GAO Tek, Juniper OEM/MX/QFX, Cisco Nexus/Catalyst/ASR, Ascent, Eoptolink, Flexoptix, Flexoptix supported vendors, Arista OEM.
    • All jobs completed.
  • Reconcile completed.
  • Equivalence matcher completed.
  • Catalog-details verifier:
    • 4,340 details verified.
  • Image backfill:
    • 48 images from expanded vendor list.
    • 12 additional Flexoptix images after case-insensitive vendor fix.

Final Observed State

  • Public health: healthy.
  • Load: ok.
  • Memory: 13%.
  • Active total: 17,714.
  • Price verified: 11,582.
  • Image verified: 12,194.
  • Details verified: 16,684.
  • Fully verified: 11,052.

Remaining Truth

  • GAO Tek is quote-only/no public price in the crawled catalog; prices were not fabricated.
  • Many OEM rows now have verified details but still need public images/prices/competitor evidence.
  • Flexoptix still has 110 image-missing SKUs after GraphQL returned no image.
  • Top remaining blockers are dominated by price/image/competitor availability.

Training Pool

  • Appended one JSONL event to /tmp/tip-training-data/robot-experiences/2026-05-09.jsonl.
  • JSONL validated successfully.