The Agent Has Shipped. Now Who Is Watching It?

Table of Contents
The Agent Has Shipped. Now Who Is Watching It

There is a version of the agentic AI story that enterprises love to tell. The pilot was a success. The agent is in production. The workflow runs faster, the headcount stays flat, and the board slides look great.

What is rarely in those slides: who approved the decisions the agent made last Tuesday? What happens when it acts on stale data? And if a customer is harmed by something the agent did autonomously, who owns that?

Governance is the part of agentic AI that almost nobody has built yet. Not because it is technically hard, though it is, but because the incentives in most organisations push toward shipping agents and worrying about oversight later. Later, in our experience, tends not to arrive.

The gap between deployment and accountability

The enterprise AI conversation has, for the past two years, been dominated by capability. What can the model do? How fast can we deploy? Can we get from pilot to production in six weeks? These are reasonable questions. They are also incomplete ones.

BCG research found that properly designed agentic systems can accelerate business processes by 30 to 50 percent and cut low-value work by up to 40 percent. Those numbers are real. But BCG also noted, in the same analysis, that governance must be embedded from day one, with each agent assigned a clear owner, defined access limits and explicit ethical boundaries.

In practice, most organisations have done the first part and skipped the second. They have agents running in production environments without documented ownership, without audit trails, and without any clear process for reviewing the decisions those agents are making on behalf of the business.

This is not a criticism of the technology. It is a structural problem. Most enterprise governance frameworks were designed for human decision-makers or deterministic software systems. Neither maps cleanly onto an autonomous agent that reasons, adapts and takes action in real time.

What governance actually means for agentic systems

Governance for AI agents is not the same as AI policy. A policy is a document. Governance is the operational infrastructure that ensures agents behave within intended boundaries, that deviations are caught, and that accountability is clear when something goes wrong.

In practical terms, it means three things working together.

The first is ownership. Every agent needs a named owner inside the business, someone who understands what the agent is authorised to do, what data it can access, and what escalation paths exist when it encounters a situation outside its parameters. Without this, agents become orphaned systems, running reliably until they do not, at which point nobody knows who is responsible.

The second is traceability. Agentic systems must log not just outputs but reasoning steps. In regulated industries this is already a legal requirement. In financial services, for instance, every automated decision touching a customer account needs an audit trail that a compliance team can interrogate. But the principle applies more broadly. If your agent took an action and you cannot explain why, you do not have a trustworthy system, you have an opaque one.

The third is intervention design. The assumption that agents will always perform within expected parameters is incorrect. Models drift. Data changes. Edge cases emerge that nobody anticipated in the design phase. Governance means building the capacity to pause, interrogate, correct and if necessary roll back agent behaviour before small failures become large ones. This is not a limitation on the technology. It is what makes the technology trustworthy enough to give real authority to.

Why most organisations are not ready

The honest answer is that building governance infrastructure is slower and less visible than shipping agents. There is no demo for a well-designed audit trail. Nobody claps for an escalation protocol.

There is also a skills gap that is rarely acknowledged. The people building agents tend to be product and engineering teams. The people who understand compliance, risk and accountability tend to sit in legal, finance or operations. These groups rarely collaborate at the design stage of an AI project. Governance gets handed over after the fact, often to people who did not build the system and do not fully understand it.

This is a version of the wider problem we see in enterprise AI. As we outlined in our AI Productivity Platform report, the organisations that successfully scale AI are not the ones with the most pilots, they are the ones that build the underlying infrastructure, clean data pipelines, unified systems, clear ownership, that allows each new agent to plug into something coherent rather than adding to an already fragmented estate.

Governance is part of that infrastructure. And like data pipelines or security architecture, it is far cheaper to build it in than to retrofit it.

Treating agents like employees, not software

One useful reframe we have found in our work with enterprise clients is to treat AI agents the way you would treat a new member of staff rather than a software deployment.

A new hire gets a job description, a line manager, access credentials scoped to what they actually need, a probationary period and performance monitoring. They are not given the keys to every system on their first day. They escalate decisions they are not authorised to make. They have someone to call when they are unsure.

Agents should work the same way. Define the scope of authority before deployment, not after. Build in human-in-the-loop checkpoints for decisions above a defined risk threshold. Review performance on a regular cadence, not only when something breaks. Assign ownership to a specific team or individual, not to the AI project generally.

The consultancies doing this well, including our own work at Elsewhen, the London-based agentic AI consultancy and digital product studio, are not treating governance as a compliance add-on. They are treating it as a design requirement, a constraint that shapes the architecture of the agent from the outset. That means defining autonomy thresholds, building sandboxed testing environments, and planning the monitoring infrastructure before the first line of production code is written.

The industry is getting louder on this, for good reason

Regulatory pressure is building from multiple directions. The EU AI Act creates tiered obligations for high-risk AI systems, including requirements around human oversight and traceability. In the UK, the government’s AI Safety Institute has made explainability and accountability central to its emerging frameworks. Sector-specific regulators in financial services, healthcare and critical infrastructure are increasingly asking pointed questions about how autonomous systems are monitored and controlled.

Beyond regulation, there is a reputational dimension. Citibank coined the term the DIFM Economy, Do It For Me, to describe where agentic financial services are heading. When agents are doing things for customers without human review, the trust bar is higher, not lower. An agent that makes a wrong call on a customer account does not just create a complaint, it creates a crisis of confidence in the entire system. The UK AI consulting firms currently leading on enterprise agentic deployment are increasingly differentiating themselves not just on capability but on the robustness of the governance frameworks they put around what they build.

And as analysis of the leading AI consulting firms operating in the UK market shows, the firms clients return to are those that build systems with accountability built in from the start, not bolted on when something goes wrong.

Accountability as competitive advantage

There is a temptation to frame governance as the conservative position, the thing risk teams want and product teams have to work around. We think that framing is wrong.

Governance is what allows you to give agents real authority. The more confidence you have in the oversight infrastructure, the more you can trust agents to act autonomously in consequential domains. Low-trust systems get confined to low-stakes tasks. High-trust systems, ones with clear ownership, proper audit trails and well-designed intervention points, earn the right to operate at the centre of the business.

The organisations that will extract the most value from agentic AI over the next five years are not necessarily the ones that ship the fastest. They are the ones that ship with enough structural integrity that the agents they build stay trusted long after the launch announcement has faded.

Getting that right is an architectural question as much as a compliance one. And it is a question worth asking before the agents are already running.

  • Peyman Khosravani is a seasoned expert in blockchain, digital transformation, and emerging technologies, with a strong focus on innovation in finance, business, and marketing. With a robust background in blockchain and decentralized finance (DeFi), Peyman has successfully guided global organizations in refining digital strategies and optimizing data-driven decision-making. His work emphasizes leveraging technology for societal impact, focusing on fairness, justice, and transparency. A passionate advocate for the transformative power of digital tools, Peyman’s expertise spans across helping startups and established businesses navigate digital landscapes, drive growth, and stay ahead of industry trends. His insights into analytics and communication empower companies to effectively connect with customers and harness data to fuel their success in an ever-evolving digital world.

Fill out the form below to request your copy.

Name(Required)