The AI race is shifting from training models to running them

Mark Venables

The Week in AI, AI Hardware/Infrastructure

Share this article

The next phase of artificial intelligence may be defined less by building larger models and more by finding efficient ways to deploy them at scale.

That was one of the central messages emerging from Intel’s latest AI announcements at Computex 2026, where the company unveiled a series of infrastructure, processor and industry partnerships aimed at addressing what it sees as the growing importance of AI inference and agentic AI systems.

The announcements reflect a broader shift taking place across the AI industry. After years of investment focused primarily on training increasingly sophisticated foundation models, attention is moving towards the practical challenge of running those models in production environments. As businesses seek to deploy AI applications across operations, customer services and industrial processes, demand for inference infrastructure is rising rapidly.

Intel argues that this transition is reshaping the architecture of modern data centres and creating new opportunities for processors, networking and system-level infrastructure.

Agentic AI changes the economics

The emergence of agentic AI is altering assumptions that have underpinned AI infrastructure planning over the past several years.

Training large language models has traditionally relied on large clusters of graphics processors, with CPUs playing a supporting role. However, Intel believes the rise of inference and autonomous AI agents is changing that balance.

According to analysis cited by the company, AI deployments during the model training era typically operated at a ratio of one CPU for every four GPUs. As AI workloads become increasingly inference-driven, that relationship is moving closer to parity, increasing the importance of processor performance, orchestration and data movement across AI systems.

To address that shift, Intel announced a rackscale AI infrastructure initiative with Foxconn and SambaNova. The companies intend to develop infrastructure designed for hyperscale data centres and AI deployments based on Intel Xeon processors and SambaNova’s reconfigurable dataflow units.

The move highlights a growing industry focus on power efficiency and operational economics. As AI workloads scale, organisations are becoming increasingly concerned not only with performance, but also with the cost and energy required to deliver inference at commercial scale.

Infrastructure becomes the competitive advantage

One of the more notable aspects of Intel’s announcements is the emphasis on complete infrastructure stacks rather than individual components.

The company unveiled a series of partnerships spanning data centre infrastructure, cloud services, manufacturing, healthcare and life sciences. Rather than positioning AI purely as a computing challenge, Intel is focusing on industry-specific solutions that combine processors, specialised silicon and software tailored to particular sectors.

Among the partnerships announced were expanded collaborations with Foxconn, Siemens, Hitachi, Echo Neurotechnologies and Greenstone Biosciences.

The Siemens collaboration reflects growing interest in applying AI across industrial environments, manufacturing systems and robotics. Intel said the companies will explore purpose-built silicon for a range of computing requirements spanning edge devices, high-performance computing and robotic systems.

The announcements suggest that AI adoption is entering a phase where industry expertise and domain-specific infrastructure may become as important as model performance itself.

The rise of the inference economy

Perhaps the clearest indication of where the market is heading came from the launch of a new enterprise inference cloud platform.

Vector Core Compute, a company formed by Vista Equity Partners and Cambium Capital, unveiled what was described as a fully disaggregated inference architecture combining Intel Xeon processors, SambaNova RDUs and NVIDIA Blackwell GPUs.

The architecture reflects a growing belief that AI workloads can be broken into specialised tasks executed across different types of hardware, potentially improving efficiency and reducing costs. Such approaches are attracting increasing attention as organisations seek to balance performance requirements with growing concerns about energy consumption and infrastructure expenditure.

Intel also announced the availability of its next-generation Xeon 6+ processors, designed for cloud-native and agentic AI workloads. Built using Intel’s 18A process technology, the processors are intended to support the orchestration and concurrency demands associated with increasingly autonomous AI systems.

Taken together, the announcements point to a significant evolution in the AI market. The industry’s focus is gradually shifting from the race to build larger models towards the challenge of operating AI at scale. As agentic systems become more common and inference workloads continue to grow, the winners may not be determined solely by who develops the most advanced models, but by who can deliver them most efficiently, reliably and economically.

For the AI sector, that could mean the next major battleground is no longer model development, but the infrastructure required to make artificial intelligence useful in the real world.