Value Test Builder
Step 1: Choose One Workflow That Matters
Focus. Not "Where can we use AI?" but "Where does performance matter most?"
Step 2A: Break the Workflow Into Steps
List the process in 3–7 discrete steps.
Step 2B: Test Each Step
For each step: Can AI perform it independently? What would "good enough" reliability look like? What is the actual reliability when tested?
| Step | AI-Tested? (Y/N) | Required Reliability | Observed Reliability | Human Oversight? |
|---|---|---|---|---|
| 1 | ||||
| 2 | ||||
| 3 |
Where reliability is below threshold, redesign or constrain the task — don't scale it yet.
Step 3A: Define the New Human–AI Split
Based on testing.
Step 3B: Define Measurable Impact
If this redesign works, what improves?
Leadership Commitment
"We will test and redesign because improving it will directly impact ."
Plan Preview
The 3-Step AI Value Test
Caversham House
Step 1: Choose One Workflow
Workflow: ...
Why it matters: ...
Current output: ...
Baseline today: ...
Step 2A: Workflow Steps
1. ...
2. ...
3. ...
4. ...
5. ...
Step 2B: Test Results
| Step | AI-Tested? | Required | Observed | Human Oversight? |
|---|---|---|---|---|
| 1 | ... | ... | ... | ... |
| 2 | ... | ... | ... | ... |
| 3 | ... | ... | ... | ... |
Step 3A: Human–AI Split
AI will handle: ...
Humans retain: ...
Escalation trigger: ...
Step 3B: Measurable Impact
Primary metric: ...
Baseline: ... → 90-day target: ...
Owner: ... | Review date: ...
Leadership Commitment
"We will test and redesign ... because improving it will directly impact ...."