1.5 KiB
1.5 KiB
2026-05-06 — MAGATAMA training count source fix
Summary
MAGATAMA training UI was still showing 1097 because the dashboard counted the legacy deduplicated fix corpus instead of the current lane-specific RunPod export.
Root cause
- Dashboard training summary read
getTrainingCorpusStats()fromgitea-learning-pool/magatamallm/fixes.jsonl. - Live Erik state still had a huge raw
fixes.jsonland an old dedupe-derived effective count path. - The actual current training source for RunPod is the lane export under:
training-data/runpod/magatamallm/magatamallm-sft-train.jsonltraining-data/runpod/magatamallm/magatamallm-sft-eval.jsonltraining-data/runpod/magatamallm/manifest.json
Fix
packages/dashboard/src/server.tsnow prefers the lane manifest formagatamallmtraining counts.- Live summary now uses:
train = 1367eval = 152totalAfterDedupe = 1519duplicatesRemoved = 1368
- Synced the current local
training-data/runpod/magatamallm/directory to Erik. - Restarted
magatama-dashboard.
Verified live
Public API now returns:
training.collectedExamples = 1367training.effectiveExamples = 1367training.evalExamples = 152training.totalExamples = 1519training.newSinceLastTraining = 1367training.collectionsPath = /opt/magatama/training-data/runpod/magatamallm/manifest.json
Operator note
If the UI still shows 1097, it is a browser cache/stale page issue. Hard reload the MAGATAMA dashboard.