How does this framework apply to multi-agent systems?

In multi-agent systems like Hive, different agents operate at different thread levels simultaneously. A monitoring agent might run as a Z-thread (fully autonomous), while a content agent operates as a P-thread (parallel, monitored). The user experience must communicate which agents are autonomous and which require supervision, using the trust design patterns from AXD to make the system legible.

Search across blog posts, projects, and services

Press ⌘K or Ctrl+K to search

Published: March 23, 2026•20 min read

Thread-Based Agentic Experience Engineering [TBE + AXD]

The unified framework connecting Thread-Based Engineering to Agentic Experience Design. How thread autonomy levels map to trust patterns and supervision models.

by Lloyd Pilapil

Geometric illustration of an engineering tower connected to a design wheel representing the union of Thread-Based Engineering and Agentic Experience Design

Two Frameworks, Same Problem, Nobody Connected Them

Over the past two months, we published two separate series. Thread-Based Engineering defined how to govern AI-assisted development through named thread types, quality gates, and autonomy levels. The AX Design Playbook defined how to design experiences where humans and AI agents share control through trust patterns, progressive autonomy, and supervision models.

Both series solved the same fundamental problem: how much autonomy should an AI agent have?

TBE answered it from the engineering side. A B-thread requires human approval at every checkpoint. A Z-thread runs autonomously with zero human intervention. The progression from B to Z is governed by quality gates, test coverage, and security scans.

AXD answered it from the design side. In-the-loop supervision means users approve every action. Out-of-the-loop means agents act independently while users review outcomes periodically. The progression from in-loop to out-of-loop is governed by trust patterns, transparency signals, and recovery mechanisms.

Same problem. Same progression. Different vocabulary.

This post connects them. Thread-Based Agentic Experience Engineering is the unified framework where engineering governance and experience design are two views of one architecture. When your engineering team says "this agent runs as a P-thread," your design team should know exactly what supervision pattern the user needs to see. When your design team says "this interaction is on-the-loop," your engineering team should know exactly which quality gates to enforce.

Why Engineering and Design Keep Talking Past Each Other

When an engineering team builds an agentic product, they think in systems: API calls, quality gates, error handling, deployment pipelines, rollback procedures. They define agent behavior through code constraints.

When a design team designs an agentic product, they think in experiences: trust signals, transparency patterns, intervention points, personality calibration, recovery flows. They define agent behavior through interaction design.

Both teams are making decisions about the same thing: what the agent is allowed to do, when it needs permission, and what happens when something goes wrong. But they use completely different frameworks to reason about it.

The result is predictable. Engineering ships an agent with robust governance (quality gates, security scans, test coverage) but no user-facing explanation of why those controls exist. Design creates beautiful trust patterns (progress indicators, confidence scores, approval workflows) but has no way to verify that the engineering infrastructure actually enforces what the UI promises.

“The most common failure mode in agentic products is not technical. It is organizational: engineering builds governance the user cannot see, and design promises trust the code does not enforce.”

Pixelmojo AX Design Playbook

This disconnect produces two failure modes:

Over-governed agents that are technically safe but feel unusable. Every action requires approval. Users abandon the product because the agent never does anything autonomously. The engineering team built governance without understanding progressive trust.

Under-governed agents that feel smooth but are technically unsafe. The UI shows confidence scores and success animations, but the underlying system has no quality gates, no rollback mechanisms, and no real enforcement. The design team built trust signals without understanding engineering constraints.

The unified framework eliminates both failure modes by giving engineering and design a shared language.

The Thread-Autonomy-Trust Map

This is the core of the framework. Each thread type maps to a supervision pattern from AXD, which determines the UX requirements.

THE THREAD-AUTONOMY-TRUST MAP

Thread types (engineering) mapped to supervision patterns (design)

Base

Parallel

Learning

Fast

Creative

Zero-touch

In-the-loop

User approves every action

On-the-loop

User monitors, intervenes when needed

Guided autonomy

Agent acts within defined scope

Out-of-the-loop

Agent acts independently

Full control

Monitored

Guided

Autonomous

What This Map Tells You

When a product manager says "make the agent more autonomous," both teams now have a concrete conversation. Moving from B-thread to P-thread means:

Engineering changes: Replace approval-required checkpoints with monitoring hooks. Add anomaly detection. Build alert thresholds. Implement rollback procedures for when monitoring catches a problem.

Design changes: Replace approval dialogs with a monitoring dashboard. Add status indicators that show what the agent is doing. Build intervention controls that let users step in without disrupting the workflow. Design recovery patterns for when the agent makes a mistake.

Neither team can make this change alone. The engineering changes without the design changes produce a technically safe agent that users cannot monitor. The design changes without the engineering changes produce a monitoring dashboard that shows fabricated status because no actual monitoring exists.

The Autonomy Ladder

The thread types form a natural progression. Most agentic products start every interaction at B-thread and gradually move specific interactions up the ladder as trust is established. This matches the progressive autonomy model from Part 2 of this playbook.

The key insight: the engineering team decides when an interaction is technically ready to move up a thread level (quality gates pass, error rates are low, rollback procedures work). The design team decides when the user is experientially ready (trust has been calibrated, the user understands what the agent does, recovery mechanisms have been tested). Both conditions must be met before upgrading.

“An agent is only as autonomous as both its engineering governance and its user trust allow. The lower of the two determines the actual operating level.”

Thread-Based Agentic Experience Engineering

How Quality Gates Become Trust Signals

In Thread-Based Engineering, quality gates are enforcement mechanisms: type checks, lint rules, security scans, test coverage thresholds. They prevent bad code from shipping.

In Agentic Experience Design, trust signals are design elements that help users calibrate their trust in the agent: confidence scores, explanation panels, audit trails, undo mechanisms.

These are the same thing viewed from different angles.

The Dual-Purpose Pattern

Consider a Claude Code hook that runs a security scan before every code change. From the engineering perspective, this is a quality gate. It prevents insecure code from entering the codebase.

Now expose that same hook to the user. Show them: "This agent passed 47 security checks before suggesting this change." From the design perspective, this is a trust signal. It gives the user evidence that the agent is competent and safe.

The infrastructure is identical. The interpretation depends on the audience. Engineering teams see enforcement. Users see evidence. Both are correct.

Practical Application

For every quality gate in your TBE governance framework, ask: "Could this be surfaced as a trust signal?"

Type checking passes → "This code compiles correctly" (competence signal)
Test coverage above threshold → "Verified against 94 test cases" (reliability signal)
Security scan clean → "No vulnerabilities detected" (safety signal)
Lint rules satisfied → "Follows all coding standards" (consistency signal)

Not every gate should become a visible signal. Users do not need to see that the code passed prettier formatting. But critical gates (security, correctness, data integrity) are powerful trust builders when surfaced appropriately.

This is how you avoid the organizational disconnect. The engineering team is not building governance in a vacuum. They are building the raw material for trust design. The design team is not fabricating trust signals. They are surfacing real engineering verification.

Worked Example: Vector from B-Thread to Z-Thread

Vector is our 12-dimension lead qualification engine. When we first deployed it, every lead score required human approval before routing. Over three months, we migrated it from B-thread to Z-thread. Here is how the engineering governance and user experience evolved at each stage.

Stage 1: Shadow (B-thread, weeks 1-2)

Vector scored every lead, but the score was advisory only. A human reviewed each score before deciding whether to send a notification or alert. The UI showed the score alongside the lead data, and the human clicked "Approve" or "Override."

Governance: Full human approval. Every score was logged with the human's decision for training data. UX: Approval dialog showing the 12-dimension breakdown. Override option with reason field. What we learned: Humans agreed with Vector's scores 94% of the time. The 6% overrides were concentrated in edge cases with unusual budget/timeline combinations.

Stage 2: Steer (P-thread, weeks 3-6)

Vector started auto-routing leads below score 50 (low quality) and above score 80 (high value). The middle range (50-79) still required human review. The UI shifted from approval dialogs to a monitoring dashboard.

Governance: Auto-routing for clear cases. Human review for ambiguous range. Alert thresholds for anomalous patterns. UX: Dashboard showing routing decisions in real time. Intervention controls for the 50-79 range. Daily summary of auto-routed leads. What we learned: Auto-routing accuracy matched human accuracy. The 50-79 range represented only 22% of leads.

Stage 3: Guide (F-thread, weeks 7-10)

Vector auto-routed all leads with pre-approved scoring rules. Humans only reviewed when Vector flagged an edge case (conflicting signals across dimensions) or when a lead explicitly asked to speak to a human.

Governance: Pre-approved scope: all standard leads. Review gate: edge cases and explicit requests. UX: Periodic summary emails. Edge case queue with context. Manual override always available but rarely used.

Stage 4: Self (Z-thread, week 11+)

Vector operates autonomously. It scores, routes, triggers notifications, and logs every decision. Humans review weekly outcome reports and adjust scoring weights when conversion patterns shift.

Governance: Full governance rails. Audit log for every decision. Rollback to previous scoring model if conversion rates drop. UX: Weekly outcome report. Exception alerts only. Scoring weight adjustment interface.

SHADOW TO STEER TO SELF

The migration pattern from full human control to autonomous operation

ShadowB-thread

Governance

Human approves every action

User Experience

Approval dialogs, action previews

Trust Signal

Agent proves competence

SteerP/L-thread

Governance

Monitoring with alert thresholds

User Experience

Dashboard, status indicators, intervention controls

Trust Signal

User learns to monitor instead of approve

GuideF/C-thread

Governance

Pre-approved scope, review gates for edge cases

User Experience

Scope boundaries visible, periodic summaries, drafts for review

Trust Signal

User defines boundaries, agent operates within them

SelfZ-thread

Governance

Full governance rails, rollback, audit logs

User Experience

Outcome reports, exception alerts only

Trust Signal

User reviews outcomes, not actions

The entire progression took 11 weeks. Each upgrade was gated by both engineering readiness (quality gates passing, error rates acceptable) and user readiness (trust calibrated, monitoring patterns established). Skipping a stage would have either broken the system or broken user trust.

Conversation Flow as Thread Architecture

In Part 3 of this playbook, we defined conversation flow architecture: the structural design of multi-turn interactions between humans and AI agents. We covered state machines, context persistence, handoff topologies, and state recovery.

These conversation flows are structurally identical to thread lifecycles in TBE.

The Structural Parallel

A thread in TBE has a lifecycle: initialization, execution, checkpoints, completion (or handoff). It maintains state across turns. It has defined escalation paths when it encounters something outside its scope.

A conversation flow in AXD has the same lifecycle: opening, multi-turn interaction, decision points, resolution (or escalation). It maintains context across turns. It has defined handoff patterns when the agent needs to transfer to another agent or a human.

The vocabulary differs. The architecture is the same.

Thread Handoffs Are Agent Handoffs

In TBE, when a thread exceeds its autonomy level, it escalates. A P-thread encountering an unfamiliar situation escalates to a B-thread (requiring human approval). A C-thread generating creative output that does not meet quality gates hands off to a human reviewer.

In AXD, when a conversation reaches an escalation point, the agent hands off. An AI agent encountering a question outside its scope transfers to a specialized agent or a human. The hub-and-spoke topology routes conversations through a central orchestrator.

Same mechanism. In TBE, the orchestrator is a governance framework that routes threads based on complexity and risk. In AXD, the orchestrator is a conversation router that routes interactions based on intent and capability. Build them as one system, not two.

Multi-Agent Systems as Multi-Thread Systems

Hive, our multi-agent orchestration platform, demonstrates this unification. In Hive, multiple agents operate simultaneously at different thread levels:

The monitoring agent runs as a Z-thread (fully autonomous, out-of-the-loop). It checks citations, flags anomalies, and logs activities without human intervention.
The content agent runs as a C-thread (creative, with review gates). It generates recommendations but surfaces them for human review before execution.
The coordinator agent runs as a P-thread (parallel, monitored). It routes tasks between agents while a human watches the dashboard.

Each agent has a different thread level, which means each agent needs a different supervision UX. The monitoring dashboard for the Z-thread agent shows outcome summaries. The review interface for the C-thread agent shows drafts for approval. The status panel for the P-thread coordinator shows real-time routing decisions.

“In multi-agent systems, the user experience is not one pattern. It is a composition of supervision patterns, one per agent, each calibrated to that agent's autonomy level.”

Thread-Based Agentic Experience Engineering

Measuring the Unified Stack

Part 5 of this playbook defined AX metrics: trust calibration, intervention rate, autonomy adoption, recovery satisfaction, and collaborative efficiency. These metrics work even better when mapped to thread outcomes.

Engineering Metrics That Feed Experience Metrics

Engineering Metric (TBE)	Experience Metric (AXD)	What It Tells You
Thread completion rate	Task success rate	Are agents finishing what they start?
Quality gate pass rate	Trust calibration accuracy	Do users trust the agent the right amount?
Governance violations	Intervention rate	How often do users need to step in?
Autonomy level distribution	Autonomy adoption	Are users upgrading agents to higher thread levels?
Thread escalation frequency	Recovery satisfaction	How well does the system handle failures?
Thread type mix (B/P/L/F/C/Z)	Supervision pattern distribution	Is the product moving toward appropriate autonomy?

The unified measurement approach means both teams look at the same data. When the engineering team sees a spike in governance violations, the design team knows to expect a corresponding drop in trust calibration. When the design team sees users refusing to upgrade agents to higher autonomy levels, the engineering team knows the quality gates at the next level need improvement.

The Autonomy Health Metric

One metric captures the health of the entire unified stack: the gap between engineering readiness and user trust readiness for each thread level.

If your engineering governance supports P-thread operation (monitoring hooks, anomaly detection, rollback procedures are all working) but users are stuck at B-thread behavior (approving every action manually), you have a trust design problem. The engineering is ready. The experience is not.

If users are comfortable with P-thread behavior (monitoring dashboards, occasional interventions) but your engineering governance is still at B-thread level (no automated monitoring, no anomaly detection), you have a governance problem. The experience is ready. The engineering is not.

The ideal state: both sides are at the same level, and that level is appropriate for the task's risk profile.

The Full Framework: From Code to Experience

Here is the complete unified framework, from the lowest engineering layer to the highest experience layer:

Layer 1: Thread Governance (Engineering)

Define the thread types your agents operate at. For each type, define quality gates, escalation rules, and rollback procedures. Implement enforcement via hooks and automated governance. This is the foundation everything else builds on.

Layer 2: Context Architecture (Engineering + Design)

Build the context hierarchy that agents use to maintain state across interactions. This includes short-term context (current session), long-term context (user history), and episodic context (specific past interactions). The same context system serves both engineering needs (agent decision-making) and design needs (personalization, conversation continuity).

Layer 3: Trust Design (Design)

Implement trust patterns calibrated to each thread level. B-thread interactions get approval dialogs. Z-thread interactions get outcome reports. Progressive autonomy lets users move interactions up the ladder as trust is established. Surface quality gate results as trust signals.

Layer 4: Conversation Flow (Design + Engineering)

Design conversation architectures that map to thread lifecycles. Multi-turn conversations maintain state like threads maintain context. Handoff patterns match escalation rules. Recovery flows match rollback procedures.

Layer 5: Personality and Voice (Design)

Calibrate agent personality to thread level. A B-thread agent that requires constant approval should have an explanatory, patient tone. A Z-thread agent that operates autonomously should have a concise, results-oriented tone. The personality adapts as the thread level changes.

Layer 6: Metrics (Engineering + Design)

Measure outcomes using AX metrics mapped to thread types. Track the autonomy health metric (engineering readiness vs user trust readiness) to identify where the unified stack has gaps.

“The framework is not six layers stacked sequentially. It is six layers operating simultaneously, each informing the others. A change in thread governance (Layer 1) ripples through trust design (Layer 3), conversation flow (Layer 4), and personality calibration (Layer 5).”

Thread-Based Agentic Experience Engineering

Patterns and Anti-Patterns

After building Vector, Hive, and several client agentic products, we have identified recurring patterns that work and anti-patterns that consistently cause failures.

PATTERNS AND ANTI-PATTERNS

Named patterns to follow and anti-patterns to avoid

Patterns

Audit-Ready Z-Thread

Autonomous agent with comprehensive logging, explainability surfaces, and one-click rollback. Every action is auditable.

B→Z

Shadow-to-Steer-to-Self

Migration pattern: start with shadow agents (B-thread, in-loop), gather performance data, progressively upgrade to Z-thread.

C-Thread Review Lane

Creative agent generates multiple options. User reviews and selects from drafts. Quality gate ensures options meet minimum bar.

Exception-Only Escalation

Z-thread agent handles normal operations silently. Only surfaces to users when anomaly thresholds are breached.

All

Thread SLA Contract

For each thread type, define max autonomy, verification method, observable UX, escalation path, and rollback procedure.

Anti-Patterns

Invisible Autonomy

Agent acts autonomously but the UI shows no indication of what it did or why. Users lose trust because they cannot verify.

Governance Theater

UI shows trust signals (confidence scores, check marks) that are not backed by real engineering verification.

Permanent B-Thread

Every interaction requires human approval forever. No path to upgrade autonomy even after thousands of successful completions.

Opaque Z-Thread

Fully autonomous agent with no audit trail, no explainability, and no rollback. One failure and all trust is destroyed.

The common thread across all anti-patterns: a mismatch between what the engineering layer enforces and what the experience layer communicates. Invisible Autonomy means the agent acts but the user cannot see. Governance Theater means the user sees signals but the code does not enforce them. Both destroy trust. The patterns solve this by keeping governance and experience in sync at every thread level.

What This Means for Teams Building Agentic Products

If you are building an agentic product today, you are almost certainly building the engineering and design sides separately. Your engineering team has governance rules. Your design team has interaction patterns. They are probably not connected.

Here is how to start connecting them.

Step 1: Map Your Current State

For every agent interaction in your product, identify the current thread level and supervision pattern. Most teams discover they are building everything as B-threads (full human approval) because they have not defined the governance rules needed to grant higher autonomy.

Step 2: Identify Upgrade Candidates

Look for interactions where users repeatedly approve the same type of action. If a user has approved the agent's email draft 50 times without a single rejection, that interaction is ready to move from B-thread to P-thread. The engineering team adds monitoring. The design team replaces the approval dialog with a dashboard.

Step 3: Build the Governance First

Before upgrading any interaction to a higher thread level, build the engineering governance for that level. P-threads need monitoring hooks and anomaly detection. F-threads need pre-approved scope definitions. Z-threads need comprehensive quality gates and rollback procedures. Do not upgrade the UX before the governance is ready.

Step 4: Surface Quality Gates as Trust Signals

Take the quality gates from Step 3 and expose relevant ones to users. Not all of them. Just the ones that help users understand why the agent is trustworthy at its new autonomy level.

Step 5: Measure Both Sides

Track engineering metrics and experience metrics together. When they diverge (governance says ready, users say not ready, or vice versa), you know exactly where to focus.

If you have been following this playbook series, you now have the complete picture: what AX design is (Part 1), how trust works (Part 2), how conversations flow (Part 3), how personality shapes trust (Part 4), how to measure it (Part 5), and now how it all connects to the engineering layer.

Ready to build agentic products with a unified framework?

AI Product Development: Ship production-ready agentic products in 90 days
Explore our products: See Vector and Hive in action
Contact us: Discuss your agentic product strategy

Thread-Based Agentic Experience Engineering: Questions Readers Ask

Common questions about this topic, answered.

Thread-Based Agentic Experience Engineering is a unified framework that connects Thread-Based Engineering (the governance model for AI-assisted development) with Agentic Experience Design (the design discipline for human-agent interfaces). It maps thread autonomy levels (B through Z threads) to user supervision patterns (in-the-loop, on-the-loop, out-of-the-loop), creating a single system where engineering controls and user trust design are two sides of the same architecture.

B-threads (Base) map to in-the-loop supervision where users approve every action. P-threads (Parallel) and L-threads (Learning) map to on-the-loop monitoring where users watch dashboards and intervene when needed. F-threads (Fast) and C-threads (Creative) map to guided autonomy with periodic checkpoints. Z-threads (Zero-touch) map to out-of-the-loop operation where agents act independently within governance boundaries.

Engineering teams think in code quality, governance rules, and deployment pipelines. Design teams think in trust patterns, conversation flows, and user mental models. Both are solving the same problem (how much autonomy should the agent have?) but using different vocabulary. TBE calls it thread types and quality gates. AXD calls it supervision patterns and progressive autonomy. The unified framework gives both teams a shared language.

In-the-loop means the user must approve every significant action before the agent proceeds. The user is inside the decision loop. On-the-loop means the agent acts autonomously while the user monitors via dashboards, alerts, or summaries. The user can intervene but does not need to approve each step. Out-of-the-loop means the agent operates independently within defined boundaries, and the user reviews outcomes periodically rather than monitoring in real time.

Engineering quality gates (type checks, lint rules, security scans, test coverage) serve a dual purpose in agentic products. For engineers, they prevent bad code from shipping. For users, they provide evidence that the agent is competent and safe. When a user sees that an agent passed 47 quality checks before executing an action, that is both an engineering gate and a trust signal. The same infrastructure serves both functions.

Yes. The supervision pattern mapping (in-the-loop, on-the-loop, out-of-the-loop) is useful for any agentic product regardless of whether you use the TBE thread taxonomy. However, the thread types provide a concrete, named vocabulary for engineering teams to communicate about autonomy levels, which makes cross-team alignment significantly easier.

In multi-agent systems like Hive, different agents operate at different thread levels simultaneously. A monitoring agent might run as a Z-thread (fully autonomous), while a content agent operates as a C-thread (creative, with review gates). The user experience must communicate which agents are autonomous and which require supervision, using the trust design patterns from AXD to make the system legible.

Track metrics across both engineering and experience dimensions. Engineering: thread completion rate, quality gate pass rate, governance violations, autonomy level distribution. Experience: trust calibration accuracy (does user trust match agent capability?), intervention rate (how often users override the agent), autonomy adoption (do users upgrade agents to higher thread levels over time?), and recovery satisfaction (how well does the system handle failures?).

Start by mapping your current agent interactions to the thread-supervision table. Identify which supervision pattern each interaction uses today. Most teams discover they are building everything as B-threads (full human approval) because they have not defined the governance rules needed to grant higher autonomy. Define those rules (quality gates, rollback procedures, monitoring dashboards) and you unlock the ability to move specific interactions up the autonomy ladder.

EY and Salesforce publish valuable work on agentic UX patterns and enterprise adoption. What they do not cover is the engineering governance layer that makes those patterns safe to implement. Their work answers what the experience should look like. This framework answers how to build the engineering infrastructure that makes those experiences trustworthy. The unified approach connects design intent to engineering enforcement.

About the Author

Lloyd Pilapil

Founder & AI Product Architect at Pixelmojo

Lloyd Pilapil is the founder of Pixelmojo and a former Salesforce engineer who builds production AI systems for B2B companies. He writes about agentic AI, multi-agent orchestration, AX (Agentic Experience) design, GEO, and Thread-Based Engineering. His work focuses on shipping AI products that generate revenue, not prototypes.

Expertise

Agentic AI SystemsMulti-Agent OrchestrationAX DesignGEO & AI SearchThread-Based EngineeringAI Product DevelopmentGrowth MarketingUI/UX Design

Two Frameworks, Same Problem, Nobody Connected Them

Both series solved the same fundamental problem: how much autonomy should an AI agent have?

Same problem. Same progression. Different vocabulary.

Why Engineering and Design Keep Talking Past Each Other

“The most common failure mode in agentic products is not technical. It is organizational: engineering builds governance the user cannot see, and design promises trust the code does not enforce.”

Pixelmojo AX Design Playbook

This disconnect produces two failure modes:

The unified framework eliminates both failure modes by giving engineering and design a shared language.

The Thread-Autonomy-Trust Map

This is the core of the framework. Each thread type maps to a supervision pattern from AXD, which determines the UX requirements.

THE THREAD-AUTONOMY-TRUST MAP

Thread types (engineering) mapped to supervision patterns (design)

Base

Parallel

Learning

Fast

Creative

Zero-touch

In-the-loop

User approves every action

On-the-loop

User monitors, intervenes when needed

Guided autonomy

Agent acts within defined scope

Out-of-the-loop

Agent acts independently

Full control

Monitored

Guided

Autonomous

What This Map Tells You

When a product manager says "make the agent more autonomous," both teams now have a concrete conversation. Moving from B-thread to P-thread means:

The Autonomy Ladder

“An agent is only as autonomous as both its engineering governance and its user trust allow. The lower of the two determines the actual operating level.”

Thread-Based Agentic Experience Engineering

How Quality Gates Become Trust Signals

In Thread-Based Engineering, quality gates are enforcement mechanisms: type checks, lint rules, security scans, test coverage thresholds. They prevent bad code from shipping.

In Agentic Experience Design, trust signals are design elements that help users calibrate their trust in the agent: confidence scores, explanation panels, audit trails, undo mechanisms.

These are the same thing viewed from different angles.

The Dual-Purpose Pattern

Consider a Claude Code hook that runs a security scan before every code change. From the engineering perspective, this is a quality gate. It prevents insecure code from entering the codebase.

The infrastructure is identical. The interpretation depends on the audience. Engineering teams see enforcement. Users see evidence. Both are correct.

Practical Application

For every quality gate in your TBE governance framework, ask: "Could this be surfaced as a trust signal?"

Type checking passes → "This code compiles correctly" (competence signal)
Test coverage above threshold → "Verified against 94 test cases" (reliability signal)
Security scan clean → "No vulnerabilities detected" (safety signal)
Lint rules satisfied → "Follows all coding standards" (consistency signal)

Worked Example: Vector from B-Thread to Z-Thread

Stage 1: Shadow (B-thread, weeks 1-2)

Stage 2: Steer (P-thread, weeks 3-6)

Stage 3: Guide (F-thread, weeks 7-10)

Stage 4: Self (Z-thread, week 11+)

Vector operates autonomously. It scores, routes, triggers notifications, and logs every decision. Humans review weekly outcome reports and adjust scoring weights when conversion patterns shift.

SHADOW TO STEER TO SELF

The migration pattern from full human control to autonomous operation

ShadowB-thread

Governance

Human approves every action

User Experience

Approval dialogs, action previews

Trust Signal

Agent proves competence

SteerP/L-thread

Governance

Monitoring with alert thresholds

User Experience

Dashboard, status indicators, intervention controls

Trust Signal

User learns to monitor instead of approve

GuideF/C-thread

Governance

Pre-approved scope, review gates for edge cases

User Experience

Scope boundaries visible, periodic summaries, drafts for review

Trust Signal

User defines boundaries, agent operates within them

SelfZ-thread

Governance

Full governance rails, rollback, audit logs

User Experience

Outcome reports, exception alerts only

Trust Signal

User reviews outcomes, not actions

Conversation Flow as Thread Architecture

These conversation flows are structurally identical to thread lifecycles in TBE.

The Structural Parallel

The vocabulary differs. The architecture is the same.

Thread Handoffs Are Agent Handoffs

Multi-Agent Systems as Multi-Thread Systems

Hive, our multi-agent orchestration platform, demonstrates this unification. In Hive, multiple agents operate simultaneously at different thread levels:

The monitoring agent runs as a Z-thread (fully autonomous, out-of-the-loop). It checks citations, flags anomalies, and logs activities without human intervention.
The content agent runs as a C-thread (creative, with review gates). It generates recommendations but surfaces them for human review before execution.
The coordinator agent runs as a P-thread (parallel, monitored). It routes tasks between agents while a human watches the dashboard.

“In multi-agent systems, the user experience is not one pattern. It is a composition of supervision patterns, one per agent, each calibrated to that agent's autonomy level.”

Thread-Based Agentic Experience Engineering

Measuring the Unified Stack

Engineering Metrics That Feed Experience Metrics

Engineering Metric (TBE)	Experience Metric (AXD)	What It Tells You
Thread completion rate	Task success rate	Are agents finishing what they start?
Quality gate pass rate	Trust calibration accuracy	Do users trust the agent the right amount?
Governance violations	Intervention rate	How often do users need to step in?
Autonomy level distribution	Autonomy adoption	Are users upgrading agents to higher thread levels?
Thread escalation frequency	Recovery satisfaction	How well does the system handle failures?
Thread type mix (B/P/L/F/C/Z)	Supervision pattern distribution	Is the product moving toward appropriate autonomy?

The Autonomy Health Metric

One metric captures the health of the entire unified stack: the gap between engineering readiness and user trust readiness for each thread level.

The ideal state: both sides are at the same level, and that level is appropriate for the task's risk profile.

The Full Framework: From Code to Experience

Here is the complete unified framework, from the lowest engineering layer to the highest experience layer:

Layer 1: Thread Governance (Engineering)

Layer 2: Context Architecture (Engineering + Design)

Layer 3: Trust Design (Design)

Layer 4: Conversation Flow (Design + Engineering)

Layer 5: Personality and Voice (Design)

Layer 6: Metrics (Engineering + Design)

Measure outcomes using AX metrics mapped to thread types. Track the autonomy health metric (engineering readiness vs user trust readiness) to identify where the unified stack has gaps.

Thread-Based Agentic Experience Engineering

Patterns and Anti-Patterns

After building Vector, Hive, and several client agentic products, we have identified recurring patterns that work and anti-patterns that consistently cause failures.

PATTERNS AND ANTI-PATTERNS

Named patterns to follow and anti-patterns to avoid

Patterns

Audit-Ready Z-Thread

Autonomous agent with comprehensive logging, explainability surfaces, and one-click rollback. Every action is auditable.

B→Z

Shadow-to-Steer-to-Self

Migration pattern: start with shadow agents (B-thread, in-loop), gather performance data, progressively upgrade to Z-thread.

C-Thread Review Lane

Creative agent generates multiple options. User reviews and selects from drafts. Quality gate ensures options meet minimum bar.

Exception-Only Escalation

Z-thread agent handles normal operations silently. Only surfaces to users when anomaly thresholds are breached.

All

Thread SLA Contract

For each thread type, define max autonomy, verification method, observable UX, escalation path, and rollback procedure.

Anti-Patterns

Invisible Autonomy

Agent acts autonomously but the UI shows no indication of what it did or why. Users lose trust because they cannot verify.

Governance Theater

UI shows trust signals (confidence scores, check marks) that are not backed by real engineering verification.

Permanent B-Thread

Every interaction requires human approval forever. No path to upgrade autonomy even after thousands of successful completions.

Opaque Z-Thread

Fully autonomous agent with no audit trail, no explainability, and no rollback. One failure and all trust is destroyed.

What This Means for Teams Building Agentic Products

Here is how to start connecting them.

Step 1: Map Your Current State

Step 2: Identify Upgrade Candidates

Step 3: Build the Governance First

Step 4: Surface Quality Gates as Trust Signals

Take the quality gates from Step 3 and expose relevant ones to users. Not all of them. Just the ones that help users understand why the agent is trustworthy at its new autonomy level.

Step 5: Measure Both Sides

Track engineering metrics and experience metrics together. When they diverge (governance says ready, users say not ready, or vice versa), you know exactly where to focus.

Ready to build agentic products with a unified framework?

AI Product Development: Ship production-ready agentic products in 90 days
Explore our products: See Vector and Hive in action
Contact us: Discuss your agentic product strategy

Thread-Based Agentic Experience Engineering: Questions Readers Ask

Common questions about this topic, answered.

About the Author

Lloyd Pilapil

Founder & AI Product Architect at Pixelmojo

Expertise

Agentic AI SystemsMulti-Agent OrchestrationAX DesignGEO & AI SearchThread-Based EngineeringAI Product DevelopmentGrowth MarketingUI/UX Design

Thread-Based Agentic Experience Engineering [TBE + AXD]

The unified framework connecting Thread-Based Engineering to Agentic Experience Design. How thread autonomy levels map to trust patterns and supervision models.

by Lloyd Pilapil

Two Frameworks, Same Problem, Nobody Connected Them

Both series solved the same fundamental problem: how much autonomy should an AI agent have?

Same problem. Same progression. Different vocabulary.

Why Engineering and Design Keep Talking Past Each Other

“The most common failure mode in agentic products is not technical. It is organizational: engineering builds governance the user cannot see, and design promises trust the code does not enforce.”

Pixelmojo AX Design Playbook

This disconnect produces two failure modes:

The unified framework eliminates both failure modes by giving engineering and design a shared language.

The Thread-Autonomy-Trust Map

This is the core of the framework. Each thread type maps to a supervision pattern from AXD, which determines the UX requirements.

THE THREAD-AUTONOMY-TRUST MAP

Thread types (engineering) mapped to supervision patterns (design)

Base

Parallel

Learning

Fast

Creative

Zero-touch

In-the-loop

User approves every action

On-the-loop

User monitors, intervenes when needed

Guided autonomy

Agent acts within defined scope

Out-of-the-loop

Agent acts independently

Full control

Monitored

Guided

Autonomous

What This Map Tells You

When a product manager says "make the agent more autonomous," both teams now have a concrete conversation. Moving from B-thread to P-thread means:

The Autonomy Ladder

“An agent is only as autonomous as both its engineering governance and its user trust allow. The lower of the two determines the actual operating level.”

Thread-Based Agentic Experience Engineering

How Quality Gates Become Trust Signals

In Thread-Based Engineering, quality gates are enforcement mechanisms: type checks, lint rules, security scans, test coverage thresholds. They prevent bad code from shipping.

In Agentic Experience Design, trust signals are design elements that help users calibrate their trust in the agent: confidence scores, explanation panels, audit trails, undo mechanisms.

These are the same thing viewed from different angles.

The Dual-Purpose Pattern

Consider a Claude Code hook that runs a security scan before every code change. From the engineering perspective, this is a quality gate. It prevents insecure code from entering the codebase.

The infrastructure is identical. The interpretation depends on the audience. Engineering teams see enforcement. Users see evidence. Both are correct.

Practical Application

For every quality gate in your TBE governance framework, ask: "Could this be surfaced as a trust signal?"

Type checking passes → "This code compiles correctly" (competence signal)
Test coverage above threshold → "Verified against 94 test cases" (reliability signal)
Security scan clean → "No vulnerabilities detected" (safety signal)
Lint rules satisfied → "Follows all coding standards" (consistency signal)

Worked Example: Vector from B-Thread to Z-Thread

Stage 1: Shadow (B-thread, weeks 1-2)

Stage 2: Steer (P-thread, weeks 3-6)

Stage 3: Guide (F-thread, weeks 7-10)

Stage 4: Self (Z-thread, week 11+)

Vector operates autonomously. It scores, routes, triggers notifications, and logs every decision. Humans review weekly outcome reports and adjust scoring weights when conversion patterns shift.

SHADOW TO STEER TO SELF

The migration pattern from full human control to autonomous operation

ShadowB-thread

Governance

Human approves every action

User Experience

Approval dialogs, action previews

Trust Signal

Agent proves competence

SteerP/L-thread

Governance

Monitoring with alert thresholds

User Experience

Dashboard, status indicators, intervention controls

Trust Signal

User learns to monitor instead of approve

GuideF/C-thread

Governance

Pre-approved scope, review gates for edge cases

User Experience

Scope boundaries visible, periodic summaries, drafts for review

Trust Signal

User defines boundaries, agent operates within them

SelfZ-thread

Governance

Full governance rails, rollback, audit logs

User Experience

Outcome reports, exception alerts only

Trust Signal

User reviews outcomes, not actions

Conversation Flow as Thread Architecture

These conversation flows are structurally identical to thread lifecycles in TBE.

The Structural Parallel

The vocabulary differs. The architecture is the same.

Thread Handoffs Are Agent Handoffs

Multi-Agent Systems as Multi-Thread Systems

Hive, our multi-agent orchestration platform, demonstrates this unification. In Hive, multiple agents operate simultaneously at different thread levels:

The monitoring agent runs as a Z-thread (fully autonomous, out-of-the-loop). It checks citations, flags anomalies, and logs activities without human intervention.
The content agent runs as a C-thread (creative, with review gates). It generates recommendations but surfaces them for human review before execution.
The coordinator agent runs as a P-thread (parallel, monitored). It routes tasks between agents while a human watches the dashboard.

“In multi-agent systems, the user experience is not one pattern. It is a composition of supervision patterns, one per agent, each calibrated to that agent's autonomy level.”

Thread-Based Agentic Experience Engineering

Measuring the Unified Stack

Engineering Metrics That Feed Experience Metrics

Engineering Metric (TBE)	Experience Metric (AXD)	What It Tells You
Thread completion rate	Task success rate	Are agents finishing what they start?
Quality gate pass rate	Trust calibration accuracy	Do users trust the agent the right amount?
Governance violations	Intervention rate	How often do users need to step in?
Autonomy level distribution	Autonomy adoption	Are users upgrading agents to higher thread levels?
Thread escalation frequency	Recovery satisfaction	How well does the system handle failures?
Thread type mix (B/P/L/F/C/Z)	Supervision pattern distribution	Is the product moving toward appropriate autonomy?

The Autonomy Health Metric

One metric captures the health of the entire unified stack: the gap between engineering readiness and user trust readiness for each thread level.

The ideal state: both sides are at the same level, and that level is appropriate for the task's risk profile.

The Full Framework: From Code to Experience

Here is the complete unified framework, from the lowest engineering layer to the highest experience layer:

Layer 1: Thread Governance (Engineering)

Layer 2: Context Architecture (Engineering + Design)

Layer 3: Trust Design (Design)

Layer 4: Conversation Flow (Design + Engineering)

Layer 5: Personality and Voice (Design)

Layer 6: Metrics (Engineering + Design)

Measure outcomes using AX metrics mapped to thread types. Track the autonomy health metric (engineering readiness vs user trust readiness) to identify where the unified stack has gaps.

Thread-Based Agentic Experience Engineering

Patterns and Anti-Patterns

After building Vector, Hive, and several client agentic products, we have identified recurring patterns that work and anti-patterns that consistently cause failures.

PATTERNS AND ANTI-PATTERNS

Named patterns to follow and anti-patterns to avoid

Patterns

Audit-Ready Z-Thread

Autonomous agent with comprehensive logging, explainability surfaces, and one-click rollback. Every action is auditable.

B→Z

Shadow-to-Steer-to-Self

Migration pattern: start with shadow agents (B-thread, in-loop), gather performance data, progressively upgrade to Z-thread.

C-Thread Review Lane

Creative agent generates multiple options. User reviews and selects from drafts. Quality gate ensures options meet minimum bar.

Exception-Only Escalation

Z-thread agent handles normal operations silently. Only surfaces to users when anomaly thresholds are breached.

All

Thread SLA Contract

For each thread type, define max autonomy, verification method, observable UX, escalation path, and rollback procedure.

Anti-Patterns

Invisible Autonomy

Agent acts autonomously but the UI shows no indication of what it did or why. Users lose trust because they cannot verify.

Governance Theater

UI shows trust signals (confidence scores, check marks) that are not backed by real engineering verification.

Permanent B-Thread

Every interaction requires human approval forever. No path to upgrade autonomy even after thousands of successful completions.

Opaque Z-Thread

Fully autonomous agent with no audit trail, no explainability, and no rollback. One failure and all trust is destroyed.

What This Means for Teams Building Agentic Products

Here is how to start connecting them.

Step 1: Map Your Current State

Step 2: Identify Upgrade Candidates

Step 3: Build the Governance First

Step 4: Surface Quality Gates as Trust Signals

Take the quality gates from Step 3 and expose relevant ones to users. Not all of them. Just the ones that help users understand why the agent is trustworthy at its new autonomy level.

Step 5: Measure Both Sides

Track engineering metrics and experience metrics together. When they diverge (governance says ready, users say not ready, or vice versa), you know exactly where to focus.

Ready to build agentic products with a unified framework?

AI Product Development: Ship production-ready agentic products in 90 days
Explore our products: See Vector and Hive in action
Contact us: Discuss your agentic product strategy

Thread-Based Agentic Experience Engineering: Questions Readers Ask

Common questions about this topic, answered.

About the Author

Lloyd Pilapil

Founder & AI Product Architect at Pixelmojo

Expertise

Agentic AI SystemsMulti-Agent OrchestrationAX DesignGEO & AI SearchThread-Based EngineeringAI Product DevelopmentGrowth MarketingUI/UX Design

Two Frameworks, Same Problem, Nobody Connected Them

Both series solved the same fundamental problem: how much autonomy should an AI agent have?

Same problem. Same progression. Different vocabulary.

Why Engineering and Design Keep Talking Past Each Other

“The most common failure mode in agentic products is not technical. It is organizational: engineering builds governance the user cannot see, and design promises trust the code does not enforce.”

Pixelmojo AX Design Playbook

This disconnect produces two failure modes:

The unified framework eliminates both failure modes by giving engineering and design a shared language.

The Thread-Autonomy-Trust Map

This is the core of the framework. Each thread type maps to a supervision pattern from AXD, which determines the UX requirements.

THE THREAD-AUTONOMY-TRUST MAP

Thread types (engineering) mapped to supervision patterns (design)

Base

Parallel

Learning

Fast

Creative

Zero-touch

In-the-loop

User approves every action

On-the-loop

User monitors, intervenes when needed

Guided autonomy

Agent acts within defined scope

Out-of-the-loop

Agent acts independently

Full control

Monitored

Guided

Autonomous

What This Map Tells You

When a product manager says "make the agent more autonomous," both teams now have a concrete conversation. Moving from B-thread to P-thread means:

The Autonomy Ladder

“An agent is only as autonomous as both its engineering governance and its user trust allow. The lower of the two determines the actual operating level.”

Thread-Based Agentic Experience Engineering

How Quality Gates Become Trust Signals

In Thread-Based Engineering, quality gates are enforcement mechanisms: type checks, lint rules, security scans, test coverage thresholds. They prevent bad code from shipping.

In Agentic Experience Design, trust signals are design elements that help users calibrate their trust in the agent: confidence scores, explanation panels, audit trails, undo mechanisms.

These are the same thing viewed from different angles.

The Dual-Purpose Pattern

Consider a Claude Code hook that runs a security scan before every code change. From the engineering perspective, this is a quality gate. It prevents insecure code from entering the codebase.

The infrastructure is identical. The interpretation depends on the audience. Engineering teams see enforcement. Users see evidence. Both are correct.

Practical Application

For every quality gate in your TBE governance framework, ask: "Could this be surfaced as a trust signal?"

Type checking passes → "This code compiles correctly" (competence signal)
Test coverage above threshold → "Verified against 94 test cases" (reliability signal)
Security scan clean → "No vulnerabilities detected" (safety signal)
Lint rules satisfied → "Follows all coding standards" (consistency signal)

Worked Example: Vector from B-Thread to Z-Thread

Stage 1: Shadow (B-thread, weeks 1-2)

Stage 2: Steer (P-thread, weeks 3-6)

Stage 3: Guide (F-thread, weeks 7-10)

Stage 4: Self (Z-thread, week 11+)

Vector operates autonomously. It scores, routes, triggers notifications, and logs every decision. Humans review weekly outcome reports and adjust scoring weights when conversion patterns shift.

SHADOW TO STEER TO SELF

The migration pattern from full human control to autonomous operation

ShadowB-thread

Governance

Human approves every action

User Experience

Approval dialogs, action previews

Trust Signal

Agent proves competence

SteerP/L-thread

Governance

Monitoring with alert thresholds

User Experience

Dashboard, status indicators, intervention controls

Trust Signal

User learns to monitor instead of approve

GuideF/C-thread

Governance

Pre-approved scope, review gates for edge cases

User Experience

Scope boundaries visible, periodic summaries, drafts for review

Trust Signal

User defines boundaries, agent operates within them

SelfZ-thread

Governance

Full governance rails, rollback, audit logs

User Experience

Outcome reports, exception alerts only

Trust Signal

User reviews outcomes, not actions

Conversation Flow as Thread Architecture

These conversation flows are structurally identical to thread lifecycles in TBE.

The Structural Parallel

The vocabulary differs. The architecture is the same.

Thread Handoffs Are Agent Handoffs

Multi-Agent Systems as Multi-Thread Systems

Hive, our multi-agent orchestration platform, demonstrates this unification. In Hive, multiple agents operate simultaneously at different thread levels:

The monitoring agent runs as a Z-thread (fully autonomous, out-of-the-loop). It checks citations, flags anomalies, and logs activities without human intervention.
The content agent runs as a C-thread (creative, with review gates). It generates recommendations but surfaces them for human review before execution.
The coordinator agent runs as a P-thread (parallel, monitored). It routes tasks between agents while a human watches the dashboard.

“In multi-agent systems, the user experience is not one pattern. It is a composition of supervision patterns, one per agent, each calibrated to that agent's autonomy level.”

Thread-Based Agentic Experience Engineering

Measuring the Unified Stack

Engineering Metrics That Feed Experience Metrics

Engineering Metric (TBE)	Experience Metric (AXD)	What It Tells You
Thread completion rate	Task success rate	Are agents finishing what they start?
Quality gate pass rate	Trust calibration accuracy	Do users trust the agent the right amount?
Governance violations	Intervention rate	How often do users need to step in?
Autonomy level distribution	Autonomy adoption	Are users upgrading agents to higher thread levels?
Thread escalation frequency	Recovery satisfaction	How well does the system handle failures?
Thread type mix (B/P/L/F/C/Z)	Supervision pattern distribution	Is the product moving toward appropriate autonomy?

The Autonomy Health Metric

One metric captures the health of the entire unified stack: the gap between engineering readiness and user trust readiness for each thread level.

The ideal state: both sides are at the same level, and that level is appropriate for the task's risk profile.

The Full Framework: From Code to Experience

Here is the complete unified framework, from the lowest engineering layer to the highest experience layer:

Layer 1: Thread Governance (Engineering)

Layer 2: Context Architecture (Engineering + Design)

Layer 3: Trust Design (Design)

Layer 4: Conversation Flow (Design + Engineering)

Layer 5: Personality and Voice (Design)

Layer 6: Metrics (Engineering + Design)

Measure outcomes using AX metrics mapped to thread types. Track the autonomy health metric (engineering readiness vs user trust readiness) to identify where the unified stack has gaps.

Thread-Based Agentic Experience Engineering

Patterns and Anti-Patterns

After building Vector, Hive, and several client agentic products, we have identified recurring patterns that work and anti-patterns that consistently cause failures.

PATTERNS AND ANTI-PATTERNS

Named patterns to follow and anti-patterns to avoid

Patterns

Audit-Ready Z-Thread

Autonomous agent with comprehensive logging, explainability surfaces, and one-click rollback. Every action is auditable.

B→Z

Shadow-to-Steer-to-Self

Migration pattern: start with shadow agents (B-thread, in-loop), gather performance data, progressively upgrade to Z-thread.

C-Thread Review Lane

Creative agent generates multiple options. User reviews and selects from drafts. Quality gate ensures options meet minimum bar.

Exception-Only Escalation

Z-thread agent handles normal operations silently. Only surfaces to users when anomaly thresholds are breached.

All

Thread SLA Contract

For each thread type, define max autonomy, verification method, observable UX, escalation path, and rollback procedure.

Anti-Patterns

Invisible Autonomy

Agent acts autonomously but the UI shows no indication of what it did or why. Users lose trust because they cannot verify.

Governance Theater

UI shows trust signals (confidence scores, check marks) that are not backed by real engineering verification.

Permanent B-Thread

Every interaction requires human approval forever. No path to upgrade autonomy even after thousands of successful completions.

Opaque Z-Thread

Fully autonomous agent with no audit trail, no explainability, and no rollback. One failure and all trust is destroyed.

What This Means for Teams Building Agentic Products

Here is how to start connecting them.

Step 1: Map Your Current State

Step 2: Identify Upgrade Candidates

Step 3: Build the Governance First

Step 4: Surface Quality Gates as Trust Signals

Take the quality gates from Step 3 and expose relevant ones to users. Not all of them. Just the ones that help users understand why the agent is trustworthy at its new autonomy level.

Step 5: Measure Both Sides

Track engineering metrics and experience metrics together. When they diverge (governance says ready, users say not ready, or vice versa), you know exactly where to focus.

Ready to build agentic products with a unified framework?

AI Product Development: Ship production-ready agentic products in 90 days
Explore our products: See Vector and Hive in action
Contact us: Discuss your agentic product strategy

Thread-Based Agentic Experience Engineering: Questions Readers Ask

Common questions about this topic, answered.

About the Author

Lloyd Pilapil

Founder & AI Product Architect at Pixelmojo

Expertise

Agentic AI SystemsMulti-Agent OrchestrationAX DesignGEO & AI SearchThread-Based EngineeringAI Product DevelopmentGrowth MarketingUI/UX Design

Two Frameworks, Same Problem, Nobody Connected Them

Why Engineering and Design Keep Talking Past Each Other

The Thread-Autonomy-Trust Map

THE THREAD-AUTONOMY-TRUST MAP

What This Map Tells You

The Autonomy Ladder

How Quality Gates Become Trust Signals

The Dual-Purpose Pattern

Practical Application

Worked Example: Vector from B-Thread to Z-Thread

Stage 1: Shadow (B-thread, weeks 1-2)

Stage 2: Steer (P-thread, weeks 3-6)

Stage 3: Guide (F-thread, weeks 7-10)

Stage 4: Self (Z-thread, week 11+)

SHADOW TO STEER TO SELF

Conversation Flow as Thread Architecture

The Structural Parallel

Thread Handoffs Are Agent Handoffs

Multi-Agent Systems as Multi-Thread Systems

Measuring the Unified Stack

Engineering Metrics That Feed Experience Metrics

The Autonomy Health Metric

The Full Framework: From Code to Experience

Layer 1: Thread Governance (Engineering)

Layer 2: Context Architecture (Engineering + Design)

Layer 3: Trust Design (Design)

Layer 4: Conversation Flow (Design + Engineering)

Layer 5: Personality and Voice (Design)

Layer 6: Metrics (Engineering + Design)

Patterns and Anti-Patterns

PATTERNS AND ANTI-PATTERNS

What This Means for Teams Building Agentic Products

Step 1: Map Your Current State

Step 2: Identify Upgrade Candidates

Step 3: Build the Governance First

Step 4: Surface Quality Gates as Trust Signals

Step 5: Measure Both Sides

Thread-Based Agentic Experience Engineering: Questions Readers Ask

About the Author

Lloyd Pilapil

Related Reading

Two Frameworks, Same Problem, Nobody Connected Them

Why Engineering and Design Keep Talking Past Each Other

The Thread-Autonomy-Trust Map

THE THREAD-AUTONOMY-TRUST MAP

What This Map Tells You

The Autonomy Ladder

How Quality Gates Become Trust Signals

The Dual-Purpose Pattern

Practical Application

Worked Example: Vector from B-Thread to Z-Thread

Stage 1: Shadow (B-thread, weeks 1-2)

Stage 2: Steer (P-thread, weeks 3-6)

Stage 3: Guide (F-thread, weeks 7-10)

Stage 4: Self (Z-thread, week 11+)

SHADOW TO STEER TO SELF

Conversation Flow as Thread Architecture

The Structural Parallel

Thread Handoffs Are Agent Handoffs

Multi-Agent Systems as Multi-Thread Systems

Measuring the Unified Stack

Engineering Metrics That Feed Experience Metrics

The Autonomy Health Metric

The Full Framework: From Code to Experience

Layer 1: Thread Governance (Engineering)

Layer 2: Context Architecture (Engineering + Design)

Layer 3: Trust Design (Design)

Layer 4: Conversation Flow (Design + Engineering)

Layer 5: Personality and Voice (Design)

Layer 6: Metrics (Engineering + Design)

Patterns and Anti-Patterns

PATTERNS AND ANTI-PATTERNS

What This Means for Teams Building Agentic Products

Step 1: Map Your Current State

Step 2: Identify Upgrade Candidates

Step 3: Build the Governance First

Step 4: Surface Quality Gates as Trust Signals

Step 5: Measure Both Sides

Thread-Based Agentic Experience Engineering: Questions Readers Ask

About the Author