Teams improve AI outcomes faster when they know where they are and what the next level requires—not when they buy another copilot.
The five levels
Level 1 — Ad hoc
Individuals use chat tools without shared standards. Success is personal.
Self-check: No shared library; leadership hears anecdotes, not metrics.
90-day move: Run 10 Signs Your Company Is Vibe Prompting; pick one pilot workflow.
Level 2 — Repeatable pilots
One or two workflows have templates and informal review.
Self-check: Pilots work in one team; they break when staff rotate.
90-day move: Document context spec and eval set for the pilot; assign owners.
Level 3 — Operational workflows
Workflows have versioned prompts, integrations, and defined handoffs.
Self-check: You can replay a case from logs; changes go through a named approver.
90-day move: Add evaluation hooks and audit trails.
Level 4 — Governed scale
Policy, data boundaries, and risk review apply across workflows.
Self-check: Governance roles are staffed; incidents trigger process updates.
90-day move: Quarterly risk review cadence; reduce tool sprawl per Your Company Does Not Need More AI Tools.
Level 5 — Continuous improvement
Metrics drive prompt, context, and model changes; regression tests block bad deploys.
Self-check: Business outcomes tie to workflow KPIs; model swaps are routine with eval gates.