What is the Intent Canvas project by Ed Chen?

Intent Canvas is a concept and thesis demo by Senior Product Designer Ed Chen that replaces the prose layer between humans and AI with a shared, editable, typed artifact. Instead of exchanging English prompts, the human and the model edit the same typed graph, and the model's response is a Semantic Diff approved row by row, each row citing the exact constraint it came from. See https://edwson.com/project-intent-canvas.html.

Is the Intent Canvas demo powered by a real LLM?

No. The portfolio-rebalance demo runs on a mock 12-holding portfolio and is explicitly deterministic — no real LLM call and no API key. It is built so the interaction itself is the argument, mirroring the structure of a real workflow while staying a controlled thesis demo.

How does the Intent Canvas Semantic Diff and line-level approval work?

A typed sentence is parsed into typed nodes (Target, Entity, Constraint, Time-window, Exclusion, Preference) on a canvas the user can correct before any action. The model then reads the graph, not the prose, and proposes a Semantic Diff — a set of changes, each carrying a rationale citation, an evidence block, dollar impact, and confidence. The user approves, rejects, or edits each proposed change line by line, and every approval becomes a signed row in an append-only audit log.

What financial regulations does Intent Canvas map to?

Intent Canvas is mapped to FINRA Rule 2111 suitability, MiFID II Article 24 best execution, SEC Rule 17a-4 record-keeping, and SOX Section 302 internal controls. The page argues that none of these rules mention AI — they require records, rationales, and approvals — and the typed canvas plus line-by-line signed Semantic Diff maps to those requirements by construction rather than by retrofit, in a form a compliance officer or SOX auditor can walk.

CONCEPT / THESIS DEMO · AI-HUMAN INTERFACE

The contribution · at a glance

Product designer — Intent Canvas · a shared typed artifact between humans and AI

A self-initiated thesis: replace the prose layer between a human and an AI agent with a shared, editable, typed canvas, where the model’s response is a Semantic Diff approved row by row.

6 + 4

Typed node kinds + labeled edge types on the Intent Graph

Graph anatomy · this page

4 → 1

Translations per turn collapsed — prose touches the system once

The loop · 5-step flow

Live workspace scenarios — rebalance, KYC, disclosure, handoff, AML

Vanilla JS · deterministic mock

Regulatory primitives mapped — FINRA 2111, MiFID II Art 24, SEC 17a-4, SOX §302

Regulatory angle · this page

Mock holdings in the inline portfolio-rebalance demo

No real LLM call · no API key

Real orders placed — every approval logs to an append-only trail

Deterministic thesis demo

Intent GraphSemantic DiffLine-level approvalAppend-only audit trailStructured outputs

Concept · design study

Concept design study — figures describe design scope and the deterministic mock, not shipped production outcomes.

Intent Canvas
A shared typed artifact between humans and AI

Q: Why did Ed Chen build Intent Canvas, and what problem does it solve for regulated AI workflows?

Ed Chen spent four years at an ASIC-regulated broker watching compliance, risk, and product teams use AI tools on work that gets audited; the prompts got longer but the drift stayed. His thesis is that prose was never meant to be a wire protocol between two systems that don't share a vocabulary — in finance, that drift becomes a compliance finding. Intent Canvas replaces prose with a typed, auditable structure to remove translation drift turn by turn.

Every conversation with an AI agent today is four translations per turn: my intent into prose, prose into the model’s graph, graph into the model’s prose, prose back into my head. Three chances for drift, every turn. This is the prototype of a different interface — one where the human and the model edit the same typed structure, and the model’s response is a diff of that structure, approved row by row, with each row citing the exact constraint it came from.

Why this exists: I spent four years at an ASIC-regulated broker watching compliance, risk, and product teams try to use AI tools on work that gets audited. The prompts got longer. The drift stayed. The problem isn’t prompt quality; it’s that prose was never meant to be a wire protocol between two systems that don’t share a vocabulary. The demo below on a mock 12-holding portfolio is the shape of what I think replaces it. No real LLM call — this is a deterministic thesis demo, built so the interaction itself is the argument.

Open live workspace Jump to inline demo Related: AI Cowork case Related: Nova terminal

The full thesis & live demo

The translation tax, the Intent Graph, the Semantic Diff, the regulatory mapping, and the live workspace — read in full, or stop at the summary above.

The translation taxThe thesisThe loopThe live demoIntent Graph anatomySemantic Diff anatomyWhy this matters for financeWhere the numbers landReferences & provenance

01 · The translation tax

We’re using prose as a wire protocol between two machines that don’t share a vocabulary.

Every conversation with an AI agent is a small act of translation. I have a structured intent in my head — a decision about my portfolio, a constraint on a disclosure, a sort order for a list of bonds — and I flatten it into English. The model reads the English, rebuilds a structured representation of its own, acts on that representation, and re-flattens the result back into English for me to read. Four translations per turn. Three chances for drift.

In a casual context that drift is the running joke of AI products: the model confidently gets something slightly wrong, you laugh, you retype the prompt. In finance it’s a compliance finding. A disclosure that reads as “margin requirements may change” instead of “margin requirements will change above 25% concentration” is the difference between an audit note and a headline.

The fix people reach for is better prompts. Longer prompts. System prompts with twenty bullet points. I spent two years writing those prompts for my own team at an ASIC-regulated broker. The prompts got longer. The drift stayed.

The problem isn’t that our prose is bad. It’s that prose was never supposed to be the protocol.

02 · The thesis

A shared, editable, typed artifact that the human and the model both look at — and neither owns alone.

An Intent Canvas is a structured representation of what the user is trying to do, rendered as a graph the user can see and edit. The model doesn’t read the user’s prose and guess at the graph. The model reads the graph. Every node is typed: constraint, entity, target, time-window, exclusion, preference. Every edge is labeled: applies-to, limits, requires, blocks.

When the user types a sentence — “reduce my tech concentration to under 30% by quarter-end” — the system extracts four nodes (Target: concentration, Entity: sector=tech, Constraint: ≤30%, Time-window: Q2 close) and shows them on a canvas. The user can correct any of them before the model proposes an action. Got the sector wrong? Click. Fix. Done. No re-prompting. No arguing with a chat bubble about what you meant.

The model’s response is not prose either. It’s a Semantic Diff: a set of proposed changes to the canvas, each with its own rationale citation. “I’m adding a node: Rebalance trade on AAPL, −2.1% position. Rationale: current AAPL weight 7.8%, tech sector weight 34.2%, largest contributor to over-limit.”

The human approves each proposed change with one click. No prompt engineering. No drift. Every approval is a signed row in an audit log.

03 · The loop

Five steps. No prose in the middle.

How a portfolio rebalance actually flows through the canvas — once at the start, once at the end, and nowhere else.

01
Prose in.

User types one sentence, or picks a preset. This is the only place prose touches the system.
02
Graph extraction.

A parser — deterministic rules plus a small model — turns the sentence into typed nodes on the canvas. User sees them immediately, rendered in plain view.
03
Graph edit (optional).

User adjusts any extracted node. This replaces prompt-engineering: if the model got “tech” wrong and meant “large-cap tech excluding NVDA”, the user clicks the node and edits.
04
Semantic Diff proposed.

The model reads the canvas (not the prose) and proposes a set of changes. Each change carries its own citation — which node in the canvas triggered it, which portfolio position it affects, what the dollar impact is, what the confidence level is.
05
Line-by-line approval.

User approves, rejects, or edits each proposed change. The approved diff is applied. The rejected and edited rows are logged as training signal for the next turn.

The prose step happens once. The canvas state is what persists across turns. Ten turns in, the canvas is a rich structured object; the prose log is a footnote.

04 · The demo (live)

“Reduce my tech concentration.”

Below is a working Intent Canvas on a mock 12-holding portfolio. Type one of the three preset prompts (or your own sentence), watch the graph extraction happen, edit the nodes if the extractor got anything wrong, and review the proposed Semantic Diff. This is deterministic — no real LLM call, no API key. The mock mirrors the structural behaviour described above. The demo isn’t the point; the shape of the interaction is.

Step 1

Prose in

Pick a preset, or type your own sentence.

Or type a sentence of your own

Step 2

Intent Graph — editable

Click any node to edit its label. Drag to reposition. This is the shared artifact the model will read.

Pick a preset above, or type a sentence and press Extract to canvas. Nodes will appear here.

Target: What is being changed.
Entity: The subject (sector, ticker, book).
Constraint: The rule (≤, ≥, Reg T…).
Time: The horizon.
Exclusion: A negation.
Preference: A soft hint.

Step 3

Semantic Diff — the model’s response

Each row carries a rationale citation, an evidence block, and a confidence score. Approve, reject, or edit row by row.

Once the canvas is populated, press Propose Semantic Diff above to see the model’s proposed changes.

Step 4

Portfolio — before and after

12 mock holdings. Applying the approved diff changes the sector weight totals. No real orders placed — pressing Apply logs to console.

Before Tech 34.2%

After (if approved) Tech —

Live Interactive Prototype

Explore the Intent Canvas Workspace

A full app-shell build of the pattern above — five scenarios, a draggable node palette, a live translation-tax meter, and an append-only audit trail. Same deterministic-mock discipline. No prompt engineering, no real LLM calls. Ship-reliable.

Intent Canvas — Live Workspace

Vanilla JS · 5 scenarios · Draggable node palette · Translation-tax meter · Append-only audit trail

Portfolio Rebalance KYC Review Disclosure Draft Agent Handoff AML Alert Triage

Open Workspace

05 · Intent Graph — anatomy

Six node types, four edge types. No free-text fields on the graph itself.

The canvas is deliberately small. The six node types below cover roughly 90% of the product-rebalance and disclosure-edit intents I saw in four years at ACY. The remaining 10% falls back to a free-text Note node that the model is told to treat as informational, never authoritative.

Target
The thing being changed. “Portfolio concentration”, “Disclosure wording”, “Sort order”. There is exactly one target per active intent.
Entity
The subject. “Sector=tech”, “Order book=equities”, “Client=institutional”. Typed, not free text.
Constraint
The rule. “≤30%”, “Reg T compliant”, “No hedge funds as counterparty”. Machine-readable predicate.
Time-window
The horizon. “Quarter-end”, “Before market open”, “By Friday”. Parsed to an absolute timestamp at extraction.
Exclusion
A negation. “Exclude NVDA”, “Not including options”, “Minus Treasury positions”.
Preference
A soft hint. “Prefer lower tax-lot impact”, “Favour liquid names”. Influences but does not force.

And four edge types:

applies-to — an entity is the subject of a target.
limits — a constraint bounds a target.
requires — one node is a precondition for another.
blocks — an exclusion prevents a target.

06 · Semantic Diff — anatomy

A diff is not a message. It’s a set of proposed graph mutations with citations.

Every row has the same shape, whether it adds a node, removes a node, or mutates an existing one. The format borrows from git diff — with every change annotated by why it was proposed, not just what it does.

+ Add node: Rebalance trade
  kind:            action
  target:          AAPL
  delta:           −2.1% position (−$64,300)
  rationale-source: node#3 (Constraint ≤30%), node#1 (Target: concentration)
  evidence:        current AAPL 7.8% of portfolio;
                   tech sector 34.2%; over-limit by 4.2pp
  confidence:      0.91
  status:          [approve] [reject] [edit]

This format matters because it makes the model’s reasoning auditable at the granularity of a row. When a compliance officer asks “why did the model propose this trade”, the answer isn’t a paragraph. It’s two pointers into a typed graph plus the specific dollar numbers that triggered the action. No prose paraphrase. No re-interpretation.

Rejected rows don’t disappear. They’re logged with the reject reason, which becomes input to the next session’s extractor tuning. A human who says “no, don’t touch NVDA” is teaching the next extractor to add an exclusion node automatically.

The audit log is a flat table of approved rows. Each row traceable to a node. Each node traceable to the prose sentence that created it. The chain is complete in either direction.

07 · Why this matters for finance

Four regulatory primitives the canvas satisfies that a chat transcript doesn’t.

None of these regulations mention AI. They mention records, rationales, and approvals. The canvas format maps directly; a prose chat transcript doesn’t.

FINRA Rule 2111

Suitability

A suitability decision needs a documented match between a customer profile and the recommended action. The canvas stores the customer profile as nodes; the Semantic Diff stores the recommended action with citations back to those nodes. The canvas is the suitability audit trail by construction, not by retrofit.

MiFID II Art. 24

Best execution

Article 24(1) requires firms to act in the client’s best interest. In UI terms that means the system must show the client the rationale for every recommended action in a form they can verify. The Semantic Diff row — with its evidence block and rationale-source pointers — is exactly that form.

SEC Rule 17a-4

Record-keeping

The rule requires trading recommendations and their basis to be retained in an immutable form for three to six years. The canvas state plus the approved diff log is a structured, replayable record. A prose chat transcript is retention; it’s not replayability.

SOX §302

Internal controls

Every material financial decision needs a signed chain of approvals. The line-by-line approval model on the Semantic Diff is a signed chain of approvals — each approved row is who-approved-what-when, in a form a SOX auditor can walk without asking follow-up questions.

08 · Where the numbers land

Three separate cost lines the canvas compresses.

These aren’t marketing numbers. They’re the three cost lines I’d defend in a technical review, with the method note attached.

Token cost per turn

Roughly 40–60% lower

The prose-only protocol has to carry the full chat history into each turn to maintain context. With a canvas, only the canvas state (typed nodes, usually under 500 tokens for a complex rebalance) is carried. For a 10-turn session the prose baseline accumulates ~8,000 tokens of rolling history; the canvas baseline carries ~500 tokens of state plus ~200 tokens of new input per turn.

Method Rough estimate against a typical 10-turn rebalance session with commercial model pricing at the time of writing. Production variance will be higher; this is the directional claim.

Hallucination rate

Near-zero on structured outputs

The model is never asked to produce free-text rationales — only to mutate a typed graph with a constrained schema. Structured-output prompting (OpenAI response_format: json_schema, Anthropic tool use, Google controlled generation) ships with schema validation; malformed responses are retried at the API layer before reaching the user. The remaining error surface is semantic — wrong sector classification — not structural — fabricated company name.

Method Observed pattern from structured-output API documentation; the structural error class is eliminated by validation at the transport layer, leaving only semantic misclassification as the residual.

Compliance review time

Roughly 3–5× faster

At ACY I watched compliance review a chat-log-style disclosure session. The reviewer read the prose, reconstructed the decision tree, compared it to the source rule. ~45 minutes per session. With a Semantic Diff log, the reviewer scans the approved-row table — each row already citing the rule — in about 10 minutes.

Method Internal observation across eight disclosure reviews between 2023–2025. Not benchmarked against production yet. The claim is directional; the mechanism (row-level citations vs prose decode) is the point.

09 · References & provenance

Where the thinking comes from — and what I’m deliberately not drawing on.

Direct theoretical sources

Donald Norman, The Design of Everyday Things — the gulf of execution and gulf of evaluation is exactly what prose-only interfaces widen, and what a shared canvas narrows.
Herbert Clark, Using Language — his work on common ground between interlocutors. The canvas is the physical artifact of common ground; before it existed, common ground had to be rebuilt by inference every turn.
Jakob Nielsen, 10 heuristics #1 “visibility of system status” — the canvas is system status made visible, with the twist that the user can edit it.

Structured-output precedent

OpenAI response_format: json_schema documentation.
Anthropic tool-use + structured tool_result patterns.
Google Gemini controlled generation.
These are the API-level hooks that make a canvas-based agent implementable today, without model retraining.

Finance reference material

FINRA Rule 2111 (suitability), FINRA.org rulebook.
MiFID II Directive 2014/65/EU, Article 24 — client best interest.
SEC Rule 17a-4, 17 CFR 240.17a-4 — record-keeping.
SOX §302 (15 USC §7241) — internal controls on financial reporting.

What I’m deliberately not using

“AI agents” as a genre — the case study avoids that framing. Agents are an implementation detail; this case is about the interface between a human and whatever is on the other side of the wire.
Chain-of-thought prose traces — CoT is a training artifact, not a user-facing audit format. Auditors don’t read CoT; they read signed approvals.

If you’re working on any of these primitives — canvas UI for AI agents, structured-output prompting patterns, auditable AI-human workflows in regulated domains — I’d like to compare notes. Email is the fastest path.

Portfolio threads

Where this case study sits in the larger web

Every problem we solve for clients has multiple valid approaches — different costs, different ROI, different risk profiles. These threads show how the approach on this page compares to others in the portfolio.

Thread

Concentration, Risk & Agents

Portfolio-level math primitives — HHI, beta, VaR, regime — rendered into UI defaults and AI-assisted decision surfaces.

Index Weights · SPX + NDX Concentration math exposed Low eng cost · 503+101 slices · HHI 230.6 / 638.2
Macro Signal Network Regime classifier feeding routing Low eng cost · 4 prints → 14 surfaces
Nova · Institutional Trading Terminal Portfolio risk surfaces on one shell High eng cost · Beta/Duration/Kelly/VaR unified
TradeX Institutional Stress test & position sizing High eng cost · 12 holdings · 3-factor composite
Hedge Fund · 19-Agent Committee Multi-persona investment consensus High eng cost · Agent reliability overhead
AI Cowork · Agent UX AI in the decision loop Med eng cost · Trust fallbacks · hallucination guardrails
Private Banking Advisory Diversification mandate High eng cost · Concentration compliance · FINRA 2111
Intent Canvas · Shared AI Artifact Prose → typed plan for rebalance Low eng cost · Typed artifact · Human approval gate

Thread

Regulatory Routing & Disclosure

How upstream regulation and macro prints become downstream product defaults and Legal-safe disclosure.

Macro Signal Network Upstream signal layer Low eng cost · 4 prints → 14 surfaces
ACY Securities · Regulated Broker System Regulated broker system High eng cost · 8 regulatory rewrites · 150 components
ACY Connect · B2B Compliance Bridge Institutional compliance bridge High eng cost · 12+ institutional clients · FIX 4.4
Private Banking Advisory Discretionary disclosure High eng cost · Reg T · FINRA 2111 · editorial voice
Nova · Institutional Trading Terminal Terminal disclosure chrome High eng cost · IRC §1091 wash-sale · SR 11-7 governance
Hedge Fund · 19-Agent Committee Agent-driven compliance High eng cost · FINRA/SEC overlays per agent
TradeX Institutional Margin & stress alerts High eng cost · FINRA 4210(c) · real-time tripwires
Intent Canvas · Shared AI Artifact Audit trail from intent to execution Low eng cost · SEC 17a-4 trail · SOX §302 audit

Thread

Evidence & Verification Discipline

How we prove design claims with data — A/B, pooled-SD, cohort analysis, and the rigor behind every number quoted on this site.

Finlogix · Retail Trader Education A/B tested retention Low eng cost · Cohen's d 2.47 · n=15 paired within-subjects
ACY Securities · Regulated Broker System Mixpanel audit High eng cost · GA4 Q2→Q3 funnel · component adoption
Index Weights · SPX + NDX Concentration math verified Low eng cost · Wikipedia + yfinance pipeline
AI Cowork · Agent UX AI reliability measurement Med eng cost · Hallucination rates · fallback UX
Intent Canvas · Shared AI Artifact Evidence attached to every AI claim Low eng cost · Evidence per row · Confidence bar

AI-UX Pattern Cluster

The orthogonal case —
adversarial isolation instead of agreement.

Intent Canvas and Double-Blind are two ends of the same design space. Intent Canvas builds a shared typed artifact where human and AI work toward agreement. Double-Blind isolates human and AI reads to surface where they disagree — and makes every divergence a documented decision. Both are necessary. Neither is sufficient alone.

Double-Blind Fiduciary Protocol

Vanilla JS · SHA-256 Web Crypto · 5 scenarios · SEC 17a-4 audit trail · SR 11-7 effective challenge

8 canonical screens Adversarial isolation UHNW wealth management Live workstation demo

View Case Study

Continue the tour

More case studies at the AI-human seam

← Previous case AI Cowork — Agent UX in the Decision Loop ROI explorer · signal-trust layer · hallucination fallback Next case → Nova — Institutional Trading Terminal Live terminal · margin / VaR / wash-sale guard · SR 11-7

View all projects → Portfolio map → Hiring? See the 90-second recruiter scan →

Product designer — Intent Canvas · a shared typed artifact between humans and AI

Intent CanvasA shared typed artifact between humans and AI

We’re using prose as a wire protocol between two machines that don’t share a vocabulary.

A shared, editable, typed artifact that the human and the model both look at — and neither owns alone.

Five steps. No prose in the middle.

Prose in.

Graph extraction.

Graph edit (optional).

Semantic Diff proposed.

Line-by-line approval.

“Reduce my tech concentration.”

Explore the Intent Canvas Workspace

Six node types, four edge types. No free-text fields on the graph itself.

A diff is not a message. It’s a set of proposed graph mutations with citations.

Four regulatory primitives the canvas satisfies that a chat transcript doesn’t.

Suitability

Best execution

Record-keeping

Internal controls

Three separate cost lines the canvas compresses.

Roughly 40–60% lower

Near-zero on structured outputs

Roughly 3–5× faster

Where the thinking comes from — and what I’m deliberately not drawing on.

Where this case study sits in the larger web

Concentration, Risk & Agents

Regulatory Routing & Disclosure

Evidence & Verification Discipline

The orthogonal case — adversarial isolation instead of agreement.

More case studies at the AI-human seam

Intent Canvas
A shared typed artifact between humans and AI

The orthogonal case —
adversarial isolation instead of agreement.