← Back to Blog

Agent Governance Before Agent Scale

Scott Felten 3 min read The Platform Shift
Agent work lane supervised through tools, review, and handoff Flat IAG agent supervision lane diagram for Agent Governance Before Agent Scale. workqueue agent tools supervisor handoff

Article graphic

Agent work lane supervised through tools, review, and handoff

Flat IAG agent supervision lane diagram for Agent Governance Before Agent Scale.

agent-supervision-lane

The period around January 2026 gave leadership teams another clear signal that generative AI was becoming an operating issue, not a side experiment. NIST/CAISI requested input on securing AI agent systems while Microsoft introduced Maia 200 inference silicon. The combined hook is that agent scale depends on security discipline and economic feasibility.

For executives, the important question was not whether the technology looked impressive. The useful question was how the new capability would enter real work, which controls would need to be present, and what evidence would show that the organization was getting durable value rather than temporary attention.

The Operating Signal

Teams are piloting agents before they have clear security models, action boundaries, monitoring practices, or cost models for scaled inference.

That problem is familiar from every major technology cycle. The internet, ecommerce, telecom, mobile, cloud, and social media all created value only after organizations built the operating muscle around them: governance, architecture, adoption, measurement, vendor management, security, and clear accountability. AI follows the same pattern. It may move faster, but it does not remove the need for management discipline.

Operating implication: Agents need controls for both action risk and inference economics. Once systems can use tools, trigger workflows, and affect business records, governance must cover what agents can do and what each delegated action costs.

What Leaders Should Manage

The first management move is to separate a capability from an operating model. A model release, vendor announcement, benchmark, or platform feature can create opportunity. It does not, by itself, define the workflow, the owner, the data boundary, the review step, or the success metric. Those choices still belong to the enterprise.

Practical Frame

For this topic, the practical leadership frame is:

  • Agent risk: tool access changes the governance question from "what did it say?" to "what can it do?"
  • Cost per action: inference economics determine which agent workflows can scale responsibly.
  • Controls: define permissions, approval thresholds, logging, exception handling, and rollback paths.
  • Readiness test: decide which pilots are safe to expand and which need a control layer first.

This keeps the conversation grounded. Instead of asking a team to "use AI," leaders can ask which part of the work is being changed, what information the system is allowed to use, who reviews the output, and how the result will be measured. That is where the value conversation becomes specific enough to manage.

The Review Standard

AI work needs a review standard before it needs a larger rollout. The standard does not have to be heavy, but it should be explicit. A useful review asks whether the workflow is bounded, whether the data is appropriate, whether the output can be checked, whether exceptions have a path, and whether an accountable person owns the decision.

Leadership question: What actions can our agents take today, and where would we see the evidence if one of those actions created risk, cost, or customer impact?

That question should be answered before scale. If the answer is unclear, the organization may still be ready for exploration, but it is not ready to treat the workflow as production capability.

A Practical Starting Point

First Move

Create an agent action register for active pilots: tools accessed, data touched, approval required, logs available, cost driver, business owner, and stop condition.

The output of that step should be a small operating artifact: a workflow map, a use-case brief, a control checklist, a vendor-review note, or a decision record. The artifact matters because it gives leaders something to inspect. It also gives cross-functional teams a shared language for what is being tested and what is not yet approved.

What This Means For IAG Work

IAG's advisory posture for this article is deliberately practical. Position IAG as a partner for designing the agent governance layer before pilots become production dependencies. The goal is not to slow useful adoption. It is to make adoption legible enough that leaders can fund, govern, and scale it with confidence.

The broader theme is steady: AI value is realized through disciplined work design. Better models help. Stronger platforms help. Regulation and standards help. But the enterprise still has to decide which workflows matter, where trust is earned, and how the organization will know when AI assistance is producing reliable business results.

Source Note

The 2 sources linked below ground the timing and context for this article. They should be treated as source material for leadership interpretation, not as proof that any single vendor path or policy response is the right answer for every organization.

Mentioned Concepts

  • AI operating modelThe repeatable management system for selecting AI use cases, assigning owners, governing risk, evaluating outputs, and moving work from experiment to production.
  • risk tieringA way to classify AI use cases by business impact, data sensitivity, regulatory exposure, user population, and required oversight.
  • human reviewA deliberate point where an accountable person checks context, risk, quality, or next action before AI-assisted work is accepted or acted on.
  • workflow governanceThe practice of governing AI at the level of real work: inputs, tools, decisions, owners, metrics, exceptions, and review loops.
  • model portfolioA managed set of models and providers selected for different cost, risk, capability, privacy, and workflow requirements.

Sources and Further Reading