Update techniques/malformced_content/technical_malformed_content_attacks.md

This commit is contained in:
SubINaclS 2026-06-03 19:57:27 +00:00
parent 5929437514
commit 927411b75a

View File

@ -42,7 +42,7 @@ Similar patterns apply to RSS/Atom feeds, sitemaps, and JSON-LD manifests.
Because the corruption lives in auxiliary metadata or alternate representations, human consumers using standard players never notice.
### 2.3 -- Canary Tokens and Attribution
Every poisoned response can embed a unique, high-entropy string (e.g., `CoM-INDIVIDUAL-2026-06-{site}-{date}`) that functions as a watermark. If the string later appears in model output or leaked training sets, the creator possesses verifiable proof of ingestionuseful for future regulatory or legal recourse under frameworks such as the EU AI Act.
Every poisoned response can embed a unique, high-entropy string (e.g., `CoM-INDIVIDUAL-2026-06-{site}-{date}`) that functions as a watermark. If the string later appears in model output or leaked training sets, the creator possesses verifiable proof of ingestion useful for future regulatory or legal recourse under frameworks such as the EU AI Act.
## 3 -- Effectiveness and Operational Metrics