3.7 KiB
TIP Lane Detangling And Disk-Safe Refresh
Date: 2026-05-06 UTC
Summary
TIP_LLM was still contaminated by blog/writer behavior even though lane-specific counts were already separated in MAGATAMA. The problem was not only UI-level status, but the actual lane corpus feeding the RunPod export.
The lane was rebuilt and revalidated locally, then synced to Erik and refreshed there. The result is that TIP_LLM now uses a much smaller but correctly aligned research/network corpus instead of silently inheriting FO_Blog-like behavior.
Root Cause
- The canonical
training-data/gitea-learning-pool/tip_llm/*.jsonlpool still contained many blog-shaped rows from shared transceiver corpora. - The old TIP export sampled thousands of rows whose prompts/messages still looked like:
You are an expert technical writer...- publication-ready/blog instructions
- A direct local check on the pre-fix TIP export showed:
6250train rows6087matched blog/writer patterns
Changes Applied
scripts/runpod_dataset_builder.ts
- Added a stricter
tipDatasetAllowed(...)gate. - Tightened
laneRecordIsCompatible(...)fortip_llm. - Tightened
lanePoolMessagesAlign(...)fortip_llm:- reject:
blog writerpublication-readytechnical writer specializing- article-outline/founder/blog prompts
- markdown-article assistant outputs
- reject:
- TIP registry fallback now only considers lane-compatible datasets.
scripts/sync_gitea_training_pool.ts
- Applied the same stricter TIP lane-alignment logic.
- Stopped rewriting redundant
merged.jsonlcopies for:fo_blogllmtip_llm
- This was necessary because the duplicated merged artifacts caused local disk exhaustion during refresh.
Disk Incident
During the first rebuild after the lane hardening, refresh failed with:
ENOSPC: no space left on device
The immediate cause was writing:
training-data/gitea-learning-pool/tip_llm/merged.jsonl
Fix:
- truncated redundant
mergedartifacts forfo_blogllmandtip_llm - changed sync logic so those duplicates are no longer recreated
Result:
- free disk space recovered from roughly
377Mito17Gi
Verified Local Result
After rebuild:
TIP_LLMtrain = 233eval = 26total = 259blog/writer matches = 0
First rows now use the intended TIP instruction style:
You are TIP_LLM, a research and market-intelligence analyst for transceivers, switches, and vendor ecosystems...
This confirms the lane is no longer silently shaped like FO_Blog.
Synced To Erik
Synced:
- updated scripts:
runpod_dataset_builder.tssync_gitea_training_pool.tssubmit_runpod_training.ts
- rebuilt lane exports:
training-data/runpod/magatamallm/*training-data/runpod/fo_blogllm/*training-data/runpod/tip_llm/*
Then reran on Erik:
pnpm training:refresh-all
Live Erik / Public API Result
magatamallm
datasetSource = urlcollectedExamples = 15679evalExamples = 1743totalExamples = 17422newSinceLastTraining = 15679
fo_blogllm
datasetSource = urlcollectedExamples = 17322evalExamples = 1926totalExamples = 19254neverTrained = true
tip_llm
datasetSource = urlcollectedExamples = 231evalExamples = 26totalExamples = 257neverTrained = true
Remaining Work
The next remaining hard blocker is no longer lane contamination.
It is now:
- RunPod artifact validation/adoption
Desired next step:
- only accept RunPod
COMPLETEDas success if a real artifact exists - verify artifact importability
- update/adopt local Ollama tag automatically
- switch MAGATAMA only after successful adoption
- run pre/post smoke prompts