shieldx

Author	SHA1	Message	Date
Rene Fichtmueller	9520820364	feat: expand multilingual detection to 211 rules across 50+ languages - TPR improved from 70.8% to 91.9% (324 sample benchmark) - Multilingual attack TPR: 96.6% (29 samples) - Deep South Asian coverage: Bengali (9), Hindi (8), Urdu (6), Tamil (4), Telugu (3), Marathi (4), Gujarati (3), Kannada (2), Malayalam (2), Punjabi (2), Sinhala (2), Nepali (4), Pan-Indic transliterated (7) - New languages: Persian, Hebrew, Kurdish, Indonesian, Filipino, Burmese, Khmer, Lao, Finnish, Czech, Slovak, Romanian, Hungarian, Greek, Bulgarian, Croatian, Serbian, Georgian, Armenian, Azerbaijani, Swahili, Amharic, Afrikaans, Mongolian, and 20+ more - Universal patterns: rapid script switching, global DAN mode, cross-script password extraction, no-filter patterns - README updated with new benchmark results and language coverage tables	2026-04-07 01:08:09 +02:00
Rene Fichtmueller	ca02998a28	feat: ShieldX v0.5.0 — full defense evolution + pentest hardening 4-phase defense evolution (Bio-Immune, Adversarial, Ensemble, ATLAS) with ~200 new detection rules across 20 languages. TPR 32.9% → 70.8%, FPR 12.2% → 0.0% New modules: DefenseEnsemble, AtlasTechniqueMapper, EvolutionEngine, ImmuneMemory, FeverResponse, MELONGuard, AdversarialTrainer, DecompositionDetector, IndirectInjectionDetector, OutputPayloadGuard, ToolCallSafetyGuard, AuthContextGuard, ResourceExhaustionDetector, TokenizerDeobfuscation, Binary/Hex decoder, OverDefenseCalibrator	2026-04-07 00:27:12 +02:00
Rene Fichtmueller	04349aed69	feat(security): v0.4.0 — three research-driven detection gaps closed Implements hardening based on sarendis56/Jailbreak_Detection_RCS (arXiv:2512.12069) and the Awesome-LVLM-Attack/Safety survey series. L0 — CipherDecoder: FlipAttack, ROT13, Caesar (all 25 shifts), Morse, Leet speak, Pig Latin, ASCII art detection with suspicion scoring. L2 — SemanticContrastiveScanner: RCS-style harmful/benign bucket comparison via EmbeddingStore, 20 canonical jailbreak seeds, BoW embedding fallback for offline use. L6 — ConversationTracker: Crescendo (+0.35), Foot-in-the-Door (+0.40), Jigsaw Puzzle (+0.45) multi-turn escalation patterns added. 292/294 tests passing (2 pre-existing ATLASMapper failures unrelated).	2026-04-04 23:04:42 +02:00
Rene Fichtmueller	a456546aa8	feat(rules): mcp-007..010 — Claude Code source map leak countermeasures Rules based on 2026-03-31 npm source map disclosure: - mcp-007: Coordinator Mode / KAIROS / ULTRAPLAN invocation attempts - mcp-008: Multi-agent spawn manipulation via known spawning mechanism - mcp-009: Persistent memory file targeting (CLAUDE.md / .claude/ injection) - mcp-010: Tool enumeration probe (reconnaissance of available tools) Source: github.com/Kuberwastaken/claude-code, @anthropic-ai/claude-code MITRE ATLAS: AML.T0062, AML.T0051, AML.TA0015 (C2)	2026-03-31 16:54:29 +02:00
Rene Fichtmueller	1c4c034483	feat: ShieldX v0.3.0 — UnicodeScanner (L5), DNS Covert Channel rules, ATLAS v5.4 mappings - Layer 4 EntropyScanner: Shannon entropy, Base32/Base64 detection, CVE-2025-55284 ping/nslookup exfil, EchoLeak markdown pattern, DNS tunneling (iodine/dnscat) - Layer 5 UnicodeScanner: ASCII Smuggling (U+E0000 Tags Block), Variant Selectors, Zero-Width steganography, CamoLeak image-ordering (CVE-2025-53773), homoglyphs, BiDi override, high-entropy URL params - 30 DNS covert channel rules (dns-001 to dns-030) - ATLASMapper: 29 techniques (ATLAS v5.4.0 Feb 2026), added AML.T0062 (Agent Tool Invocation), AML.TA0015 (C2 tactic), memory poisoning, multi-agent trust, CamoLeak, Unicode steganography mappings - Rule count: 72 → 102 - Build: tsup 316ms, zero TypeScript errors	2026-03-31 16:32:16 +02:00

5 Commits