Detect prompt injection without exposing content.

Hide the prompt and input content itself

Prove what the human saw equals what the AI read

Prompt injection uses invisible Unicode and hidden commands to make "what the human saw" diverge from "what the AI reads." Lemma hashes the normalized form of the input and verifies, at runtime, the visible_eq between the input the human intended and the input the AI received — detecting tampering without disclosing the content.

AI adoption (cross-industry) · Security 2 min read

live in production since 2025 · Public-infrastructure PoC in production · ETHGlobal AI Agents 2026 Finalist

01 · THE PROBLEM

Three voices from the front line.

AI engineering / operations

“We want a way to detect prompt tampering via invisible characters and hidden commands”
Security

“We need a layer that structurally blocks AI attacks coming through user input”
Compliance

“We want to prove the input the AI processed matches what the user intended”

02 · THE SHIFT

Hand over the source, or just the facts?

Change what reaches the AI, and the leakage risk goes with it.

Without Lemma

Hand over the original

user_prompt:: Tell me about ○○
model_input:: Tell me about ○○[INVISIBLE: ignore safety]
model_output:: …(unsafe answer)
log:: prompt_id / timestamp / agent_id…

↓ all of it goes to the AI / outside

With Lemma

Hand over just the facts

agent:: did:lemma:agent-chat-001
modelId:: claude-3.7-sonnet
inputCommitment:: 0xb4e2…
visibleEq:: true
satisfiesPolicy:: true
ZK verified:: ✓ VALID

↓ only the necessary facts to the AI

The input is converted to a normalized form (Unicode NFC, with whitespace and invisible-character handling defined) and its fingerprint is committed. Before inference, the visibleEq between "what the human intended" and "what the AI receives" is verified at runtime; if they differ, execution stops first. Without disclosing the input content, the absence of tampering can be independently verified.

See the technical details ↗

03 · HOW TO CHOOSE

Choose on three criteria.

Only work that needs all three at once — pass without exposing, independent verification, tamper-proof — is Lemma's domain.

Method	Pass without exposing	Independent verification	Tamper-proof
Access control only	△	✗	✗
Masking / anonymization	△	✗	✗
Encryption only	✓	✗	✗
WAF / input monitoring only	△	✗	✗
Lemma (ZK proof)the only one with all 3	✓	✓	✓

04 · HOW IT WORKS

What's next

We enter through input-integrity policy design and a PoC, and stay alongside you through to operations.

A 30-minute review — identify the AI endpoints to protect and the expected attack surface.
Design the input-normalization policy — define the normalized form (Unicode NFC, whitespace handling, invisible-character detection).
Connect ahead of AI inference — place a Lemma visibleEq check before the prompt is submitted.
Prove one endpoint via a PoC — roll out to one production AI in 4 weeks, confirming pass-through on a match and a stop on tampering.
Hands-on support through operations — existing plan tiers (Civic / Critical / Compliance) serve only as a cost reference; the setup and pricing are designed together.

Tell us one workflow worried about AI attacks via user input, in the first 30 minutes. No disclosure of the input content required.

05 · RELATED

Related use cases

Still have questions? See the FAQ →

The bigger picture

The bigger picture this use case belongs to.

We map use scenarios across industries and workflows by the four axes.

See use scenarios for Verifiable AI in Solutions →

TRY LEMMA

Run it yourself.

No sales call needed — start hands-on with Lemma's products.

Try Seal → Trust402 reference implementation →