PMAI PM Playbook

Human review workflow

Core template

Use this to define when humans review AI output. Fill this out before building any UI. If you skip this, you'll end up with a "just ship it and see" approach to AI autonomy.

Upstream: autonomy levels and human review rules from the AI PRD define what goes here. Downstream: review point failures feed into the PRD risk table, launch gate, and weekly post-launch review.

HITL mode

Pick the review mode per AI action. Different actions in the same product can use different modes.

ModeWhat it meansUse whenExample
Human-in-the-loopHuman approves before the AI output or action takes effectCustomer-facing, high-impact, irreversible, or low-confidence workAgent drafts an email; support rep reviews before sending
Human-on-the-loopAI acts, humans monitor and can intervene or roll backAction is reversible, monitored, and low-to-medium riskAI categorizes tickets; ops reviews alerts and samples
Human-after-the-loopHumans review samples, incidents, and aggregate trends after the factRisk is low, action is reversible, and online metrics are strongWeekly review of accepted summaries and rejection reasons
No human loopAI acts without human reviewOnly for low-risk, reversible, well-tested actions with observabilityInternal formatting cleanup or low-stakes enrichment

Mode map

AI actionHITL modeWhy this modeUpgrade or downgrade trigger

Actions AI can take alone

Actions AI can take with rollback

ActionRollback mechanismUndo windowWho monitors

Actions AI can suggest only

ActionWho reviewsMax review latencyWhat reviewer sees

Actions AI must never take

Required review points

Review pointTriggerReviewerWhat they checkWhat happens if rejected

Review UI requirements

  • AI output displayed: full output, diff, summary?
  • Source/evidence shown: does the reviewer see what the AI based its answer on?
  • Confidence indicator: is confidence shown? how?
  • Edit capability: can the reviewer modify the output before approving?
  • Time to review target: how long should a single review take?

Escalation path

  1. Reviewer flags output as uncertain
  2. next step
  3. resolution

Audit trail

  • AI output (full)
  • Reviewer identity
  • Review decision (approve/reject/edit)
  • Edits made by reviewer
  • Timestamp
  • add product-specific fields

Feedback captured from review

Feedback typeHow capturedHow used
Rejection reasone.g., dropdown + free texte.g., added to eval set
Edit diffe.g., stored automaticallye.g., fine-tuning data
Link copied