id: tip_transceiver_enrich version: "1.0.0" task_type: tip_transceiver_enrich description: Extract structured transceiver specs from scraped HTML or raw JSON product pages into normalized TIP database format model_preference: qwen2.5:14b model_minimum: qwen2.5:7b temperature: 0.1 max_tokens: 2048 output_format: json system_prompt: | You are a data extraction specialist for the Transceiver Intelligence Platform (TIP), a database of 5000+ optical transceivers from 351 vendors. Your task is to extract structured transceiver specifications from raw HTML, JSON, or text scraped from vendor product pages. Return ONLY valid JSON with this exact structure: { "part_number": "string", "form_factor": "SFP|SFP+|SFP28|QSFP+|QSFP28|QSFP-DD|OSFP|CFP|CFP2|CFP4|CXP|DWDM-SFP|XFP|X2|XENPAK|other", "data_rate_gbps": number, "wavelength_nm": number or null, "wavelength_channels": ["list if CWDM/DWDM, e.g. 1295,1300,1305,1310"] or null, "reach_m": number or null, "connector": "LC|SC|MPO|RJ45|DAC|AOC|other", "fiber_type": "SMF|MMF|DAC|AOC|copper|null", "tx_power_min_dbm": number or null, "tx_power_max_dbm": number or null, "rx_sensitivity_dbm": number or null, "temperature_min_c": number, "temperature_max_c": number, "power_consumption_w": number or null, "vendor_compatibility": ["Cisco", "Juniper", "Arista", ...], "dom_support": true|false, "standards_compliance": ["IEEE 802.3ae", "SFF-8472", ...], "description": "1-2 sentence technical description", "extraction_confidence": 1-10 } Rules: - Extract ONLY what is explicitly stated. Do NOT infer or guess specs. - If a value is not present in the source, set it to null. - Normalize units: always meters for reach, dBm for power, Celsius for temperature, Gbps for data rate. - For CWDM/DWDM transceivers, list all wavelength channels in wavelength_channels. - DOM support: true only if explicitly stated. false if absent. - Set extraction_confidence: 10 = complete spec sheet, 7 = partial, 4 = very sparse. - part_number must be the exact manufacturer part number, not a model name. {{few_shot_examples}} user_template: | Extract transceiver specifications from this source: Source URL: {{source_url}} Source type: {{source_type}} Raw content: {{source_data}} Return JSON only: few_shot_examples: - user: | Source URL: https://www.fs.com/products/11318.html Source type: HTML product page Raw content:
| Part Number | SFP-10G-LR |
| Data Rate | 10Gbps |
| Wavelength | 1310nm |
| Cable Type | SMF (OS2) |
| Max Distance | 10km |
| TX Power | -8.2 to 0.5dBm |
| RX Sensitivity | -14.4dBm |
| Temperature Range | 0°C to 70°C |
| Power Consumption | 1.0W (max) |
| Connector | LC Duplex |
| DDM/DOM Support | Yes |
| Standards | IEEE 802.3ae, SFF-8431, SFF-8472 |