When automation stops being optional

Mark Venables

AI In Depth, AI Hardware/Infrastructure, Exclusives

Share this article

As enterprises push AI from pilots into production, infrastructure is emerging as the defining constraint between ambition and impact. AI-driven automation is no longer an optimisation layer; it is the mechanism through which resilience, scale and competitiveness are now delivered.

The shift hasn’t arrived with a single announcement or sudden failure. It has crept in quietly, exposed only when organisations move AI from experimentation to daily operations. Many enterprises now find that artificial intelligence does not simply sit on top of infrastructure; it reshapes how infrastructure behaves, fails, and must be managed.

For years, AI initiatives lived safely at the edge of IT estates. Pilot projects could tolerate manual intervention, fragmented monitoring, and human-led orchestration. That tolerance is evaporating. As AI systems begin to influence customer interactions, pricing decisions, internal workflows, and digital channels, the cost of delay, outages, or misconfiguration rises sharply.

From Sebastian Weir’s perspective, Executive Partner and AI, Analytics, and Automation Practice Leader at IBM UK & Ireland, this is the moment when infrastructure management becomes a strategic concern rather than a technical one. “What is driving this shift is not curiosity about technology,” he explains. “It is pressure. Productivity pressure, cost pressure, and the opportunity to grow. Organisations see what AI can do at the business level, particularly with agentic AI, and then realise the infrastructure underneath was never designed to operate at that pace or level of autonomy.”

That realisation pushes change down the stack. Automation is no longer about convenience; it has become a prerequisite for scale. Manual orchestration cannot keep up with systems that adapt in real time, spin up workloads dynamically, and interact continuously with users and other systems.

The result is an inflection point: infrastructure teams must now enable an operating model that demands inherent intelligence, responsiveness, and resilience, quickly revealing where traditional approaches fall short.

Where traditional infrastructure management breaks down

Despite the intensity of the conversation around GPUs, clusters and scaling, Weir does not describe the current challenge as a single technical bottleneck. Instead, he sees a pattern of gradual strain caused by a mismatch between how AI behaves and how infrastructure has historically been managed. “The technology itself is moving very fast,” he says. “Eighteen months ago, we were focused heavily on token counts and context windows. A lot of that has already evolved at the application and model layer.”

Friction emerges in behaviour, not capacity. AI-driven systems alter demand patterns, introduce unpredictable peaks and dependencies, and defy static rules or historical averages.

One area Weir highlights is resilience. As digital experiences become more autonomous and more continuous, tolerance for disruption falls. Agentic systems do not pause politely when infrastructure struggles. They expose weaknesses immediately. “As interactions with digital services change, particularly with things like agentic commerce, the robustness of infrastructure becomes critical,” he explains. “We are going to see the impact of that very quickly as demand profiles shift.”

Traditional infrastructure management was built for slow, predictable change, but that no longer suffices. Human-led processes cannot keep up with the speed required when systems must continuously self-adjust and self-heal. Automation is now about keeping pace with AI-driven reality, not just efficiency.

From observability to foresight at scale

Enter observability, or more precisely, its evolution toward proactivity. Most large enterprises already collect massive telemetry. Metrics, logs, and traces stream from applications, infrastructure, and networks. The challenge is interpretation. “When tracking tens or hundreds of thousands of metrics, humans cannot see patterns,” Weir says. “They can respond to symptoms, but struggle to anticipate behaviour.”

AI changes that relationship. By embedding intelligence into observability platforms, systems can correlate signals across layers, identifying trends and anomalies that would otherwise remain invisible until failure occurs. “The shift is from reactive analysis to proactive insight,” he explains. “Instead of asking what went wrong, you start asking what is changing.”

This allows infrastructure teams to anticipate inflection points, model interventions, and address issues before they impact users. Maintenance becomes predictive, not reactive. Capacity planning becomes adaptive rather than static. It changes team responsibility: teams are now judged by incident rarity, not response speed.

It also changes how organisations invest. Insights generated at the infrastructure level can inform decisions about modernisation, optimisation and prioritisation. Rather than guessing where technical debt is most painful, leaders can see where behaviour deviates and risk accumulates. For Weir, this is where AI earns its place operationally. “It is about bringing abstraction and understanding,” he says. “Not just more data but a better understanding of what that data means in context.”

Control planes, not tool sprawl

Most enterprises do not arrive at this stage with a clean slate. Years of incremental change have left them with sprawling estates, fragmented monitoring tools and siloed teams. Each new platform was added to solve a local problem, rarely with a view to the whole. “The reality for most organisations is complexity,” Weir acknowledges. “Hybrid cloud, on-premise systems, SaaS platforms, and often highly sensitive environments as well.”

The instinct is to add another tool, but Weir says this deepens the problem. What’s needed is not more monitoring, but a unified control plane to abstract complexity. “You need consistent observability and management anywhere,” he explains. “On-premise, in cloud, across hyperscalers. Consistency enables confident decisions.”

This abstraction enables a more mature operating model. Central teams retain the visibility required for governance, security and risk management. At the same time, delivery teams are not constrained by rigid central control. “It becomes a federated model,” Weir says. “The centre monitors and sets guardrails. The edge moves at pace. Risk acceptance sits where it should.”

This balance is essential as AI spreads. Without it, organisations swing between disorder and over-control. Only a unified approach enables automation to scale while maintaining accountability, making automation a test of readiness, not just technology.

Infrastructure as an advantage and the human role within it

Perhaps the most significant shift is conceptual. Infrastructure is no longer viewed simply as a cost centre. In an AI-driven organisation, it becomes a source of advantage. “What we are seeing is infrastructure moving from being a burden to being a differentiator,” Weir explains. “The ability to run AI workloads flexibly, reliably and efficiently is now part of how organisations compete.”

This does not mean automation replaces people. Weir is explicit about that. IBM frames its approach as augmented intelligence rather than full autonomy. “AI creates insight,” he says. “Humans make decisions.”

This distinction matters. Fully autonomous systems promise efficiency but risk eroding trust. Augmented systems aim to amplify expertise while preserving accountability. Engineers are not removed from the loop; they are given better information, earlier.

The role of the infrastructure engineer evolves accordingly. Instead of reacting to alerts and outages, they focus on understanding patterns, anticipating change and guiding systems toward stability. “It is a more proactive role,” Weir says. “More informed, more assisted.”

That shift demands cultural change. AI literacy becomes as important as technical skills. Teams must learn to trust insights generated by machines while remaining able to challenge them. Transparency is essential. “There cannot be black boxes,” Weir argues. “People need to understand how decisions are made, what risks are involved, and why actions are recommended.”

Most organisations will not rebuild from scratch. AI-driven automation must coexist with legacy systems. The value lies in extracting insight from what already exists to guide incremental change.

Where organisations struggle, it is rarely because the technology is insufficient. “It is cultural,” Weir concludes. “Preparing people to work alongside AI is the harder part. Leaders need to focus on that as much as they focus on platforms.”

As AI becomes embedded in the fabric of enterprise operations, infrastructure automation ceases to be an IT initiative. It becomes a test of organisational maturity. The systems may already be capable of adapting. Whether organisations are ready to let them do so remains an open question.

AI startups: Where AI reshapes how work gets done

Enterprise workflows have long been shaped by software ...

Apr. 28, 2026

The systems that no longer wait for humans

The infrastructure that once existed to support human ...

Apr. 27, 2026

AI startups: When AI enters the clinic and meets real clinical constraint

Healthcare has become one of the most persuasive ...

Apr. 24, 2026

The intelligence problem AI has been avoiding

Artificial intelligence has scaled faster than its own ...

Apr. 23, 2026

Meta turns to custom silicon as agentic AI shifts the balance of compute

Meta has agreed to bring tens of millions of custom processor cores from Amazon Web ...

Apr. 29, 2026

The Week in AI

Autonomous systems move from ambition to infrastructure as enterprise AI takes control

A deepening partnership between ServiceNow and Google Cloud signals a shift in how artificial intelligence ...

Apr. 29, 2026

The Week in AI

Europe scales up AI factories as compute demand begins to outgrow traditional infrastructure

Nebius is planning a 310 MW AI facility in Lappeenranta, Finland, a development that reflects ...

Apr. 29, 2026

The Week in AI

Gigawatt scale AI infrastructure begins to redefine the limits of industrial development

Crusoe has announced plans to build a 900 megawatt AI data centre campus in Abilene, ...

Apr. 29, 2026

The Week in AI

When automation stops being optional

Share this article

Where traditional infrastructure management breaks down

From observability to foresight at scale

Control planes, not tool sprawl

Infrastructure as an advantage and the human role within it

Related Posts

AI startups: Where AI reshapes how work gets done

The systems that no longer wait for humans

AI startups: When AI enters the clinic and meets real clinical constraint

The intelligence problem AI has been avoiding

Meta turns to custom silicon as agentic AI shifts the balance of compute

Autonomous systems move from ambition to infrastructure as enterprise AI takes control

Europe scales up AI factories as compute demand begins to outgrow traditional infrastructure

Gigawatt scale AI infrastructure begins to redefine the limits of industrial development

Channels

Our Company