1.9 KiB
1.9 KiB
TIP Global Verification Continuation — 2026-05-09
Scope
- Continue TIP verification with deterministic scrapers/robots only.
- Keep Erik safe; no heavy Playwright/proxmox-heavy wave.
- Write learnings into the TIPLLM training pool.
Implemented
- Repaired GAO Tek scraper for the current Woodmart product-card layout.
- Excluded category URLs from active product verification/search counters.
- Added a catalog-details verifier for complete source-backed OEM/catalog specs.
- Fixed Flexoptix image backfill case sensitivity.
- Expanded
og:imagebackfill vendor coverage. - Hardened scheduler reconcile so category URLs are not promoted as details source.
Live Runs
- GAO Tek:
- 20 pages fetched.
- 480 real product cards extracted.
- 0 public prices found.
- 6 category/non-product artifacts reset.
- Priority pi-fetch wave:
- GAO Tek, Juniper OEM/MX/QFX, Cisco Nexus/Catalyst/ASR, Ascent, Eoptolink, Flexoptix, Flexoptix supported vendors, Arista OEM.
- All jobs completed.
- Reconcile completed.
- Equivalence matcher completed.
- Catalog-details verifier:
- 4,340 details verified.
- Image backfill:
- 48 images from expanded vendor list.
- 12 additional Flexoptix images after case-insensitive vendor fix.
Final Observed State
- Public health: healthy.
- Load: ok.
- Memory: 13%.
- Active total: 17,714.
- Price verified: 11,582.
- Image verified: 12,194.
- Details verified: 16,684.
- Fully verified: 11,052.
Remaining Truth
- GAO Tek is quote-only/no public price in the crawled catalog; prices were not fabricated.
- Many OEM rows now have verified details but still need public images/prices/competitor evidence.
- Flexoptix still has 110 image-missing SKUs after GraphQL returned no image.
- Top remaining blockers are dominated by price/image/competitor availability.
Training Pool
- Appended one JSONL event to
/tmp/tip-training-data/robot-experiences/2026-05-09.jsonl. - JSONL validated successfully.