Small is powerful when the enterprise takes control

Share this article

Small language models promise to reshape enterprise AI by delivering tailored accuracy, lower energy costs and tighter governance, but only for those willing to take responsibility for their own data foundation.

For years, large language models have captivated the AI world with their breadth and capability. But the very strengths of these monolithic systems, their scale, generality and expansive training data, have become their core weaknesses in enterprise deployment. Enterprises do not need general intelligence. They need answers, accuracy and trust. That shift in mindset is triggering a broader re-evaluation of what AI infrastructure should look like.

Small language models, or SLMs, are emerging as a serious contender for enterprise-grade AI. Unlike their colossal cousins, these models are trained on focused, domain-specific data, allowing them to deliver more relevant results, operate with fewer resources, and offer greater explainability. They do not aim to be all-knowing. Instead, they are built to know just enough about the right things.

“If you want a chatbot to handle customer service or triage documents in life sciences, you do not need the model to understand nineteenth-century poetry or internet memes,” Sidd Rajagopal, Chief Architect for EMEA and LATAM at Informatica, says. “You want the model to be deeply familiar with your domain and to operate predictably within that context.”

Performance, precision and sustainability

Enterprises drawn to SLMs are doing so not for novelty but necessity. Large models are power-hungry, expensive and increasingly difficult to justify from a sustainability perspective. SLMs, by contrast, offer an attractive trade-off. They are faster to train, cheaper to run and more energy efficient.

But their appeal is not just economic. SLMs, by their specificity, offer a higher degree of control. They can be tuned to prioritise certain risk thresholds, operate within regulatory constraints, and deliver more consistent outputs when narrow accuracy matters. That makes them particularly valuable in sectors like healthcare, finance and engineering, where the cost of error is not reputational, but operational and, in some cases, life-threatening.

“You need to be able to trust what the model is doing,” Rajagopal explains. “SLMs are simpler to audit. Their smaller parameter base and narrow focus make it easier to trace outputs to training data and fine-tune their behaviour.”

This is especially relevant for industries moving towards the edge. Wearables, diagnostic tools and remote monitors increasingly require embedded intelligence. LLMs, with their dependency on cloud infrastructure and high compute, simply cannot operate in those environments. SLMs can.

This edge applicability extends into new domains where latency, privacy and bandwidth limitations preclude reliance on centralised AI systems. Healthtech, automotive diagnostics, field robotics, and smart wearables are only the beginning. The ability to train and deploy models that live and function on-device is a pivotal turning point.

These benefits become even more critical as industries shift to decentralised infrastructure. In remote or regulated environments where data residency and network constraints matter, the ability to run localised AI that respects legal boundaries and performance thresholds is no longer optional. SLMs allow for this operational sovereignty, reducing exposure to compliance risks and minimising dependence on cloud hyperscalers.

Data readiness determines success

Yet none of the benefits of small language models can be realised without the proper data foundation. And this is where many organisations stumble. Models do not compensate for poor inputs. On the contrary, smaller models often magnify the consequences of low-quality or biased data.

“Garbage in, garbage amplified,” Rajagopal warns. “Because these models are specialised, they depend heavily on the integrity of the data they are trained on. If your data is skewed or under-governed, the model will reflect that.”

The implication is stark: SLMs demand more from their users. Unlike public LLMs that rely on a buffet of scraped internet content, SLMs require carefully curated, context-rich, proprietary data. That demands mature data governance, lineage tracking and consistent metadata management, not just for structured information, but for semi-structured and unstructured formats as well.

This includes internal documentation, email communications, customer support transcripts, PDFs, images, and even video and audio content. Without rigorous control, the risk of encoding operational bias or systemic error into the model becomes more than theoretical. It becomes a legal, ethical and financial liability. “You cannot expect consistent outputs from an inconsistent data foundation,” Rajagopal continues. “The responsibility lies with the enterprise to establish and maintain the integrity of its data assets.”

In many cases, the most significant technical barrier to successful SLM deployment is not the model itself, but the fragmented nature of enterprise data ecosystems. Data silos, conflicting schemas, and outdated taxonomies can obstruct AI alignment. Solving this requires not just tooling but executive commitment. Cross-functional teams must be tasked with cleaning, harmonising and governing data before model development even begins.

A governance opportunity, not a burden

One of the persistent misconceptions around AI deployment is that governance is a blocker. In reality, SLMs offer an opportunity to reframe that conversation. Because they are smaller, more transparent and enterprise-controlled, they make governance more tractable. Testing, auditing and refining outputs becomes feasible. So does establishing repeatable processes around bias mitigation, fairness and performance benchmarks.

Rajagopal sees this as an emerging advantage. “If you build the model yourself, using your own data and under your own supervision, then you also own the governance,” he explains. “You are not waiting for a third party to fix hallucinations or explain behaviour. You have control.”

This shift matters for regulatory compliance, but it also shapes organisational culture. Enterprises that take ownership of AI governance are more likely to cultivate internal trust and to accelerate experimentation. The ability to build, test and deploy in shorter cycles, without opaque dependencies, reduces the fear factor that has held many back from using AI more aggressively.

There is a broader consequence here. The process of training an SLM naturally aligns with responsible innovation. Rather than delegating AI oversight to external vendors, the internal team must learn, adapt and engage with ethical and regulatory principles. Transparency becomes baked into the architecture.

Over time, these habits build institutional capability. Engineering teams develop stronger practices for risk evaluation. Legal and compliance functions become active partners in deployment. Most importantly, leadership gains a clearer understanding of what the model is doing and why. That alignment between model behaviour and business value is rarely possible with generic, third-party AI systems.

Legacy infrastructure is not the enemy

Executives often assume that their data systems are too fragmented or outdated to support modern AI initiatives. That assumption is increasingly out of step with technical reality. Legacy systems may be siloed and inflexible, but they often contain the richest historical context available to the business. What matters is how those systems are connected, not whether they are replaced.

APIs, data lakes and vector databases are enabling enterprises to tap into legacy repositories without re-architecting entire operations. Even older enterprise resource planning systems, when properly indexed and surfaced, can power powerful domain-specific models.

“There is a lot of gold hidden in old systems,” Rajagopal says. “You do not need to rip and replace. You need to extract and align.”

This approach extends the value of past investments while setting a trajectory for future interoperability. SLMs do not require perfection. They require relevance.

Companies are already layering modern ML infrastructure over heritage environments to unlock new applications. When governed appropriately, this blend of old and new can yield greater insight than either system could produce alone.

Agentic reasoning and the road ahead

The future of enterprise AI is not one of scale for its own sake. It is one of collaboration, composition and context. Agentic AI systems are now being developed to manage workflows, initiate actions and self-adapt to new inputs. These agents will likely be built upon a mix of model types, each chosen for its suitability to a specific function.

That future puts small language models in a powerful position. Rather than relying on a single dominant model to answer every question, multiple SLMs could be orchestrated in tandem, each executing a defined task, each governed by clear policies, each pulling from validated data sources. “The age of the monolith is ending,” Rajagopal concludes. “What is coming next is a federation of models, each designed to do one thing well, and each made stronger by the enterprise data behind it.”

It is not an easy path. But for enterprises willing to take ownership of their data, build targeted models and invest in responsible governance, it is the most powerful one available.

In time, small models may not just be more efficient, they may also be more intelligent, precisely because they are closer to the domain, closer to the edge and closer to the people they are meant to serve.

Related Posts
Others have also viewed

Episode 29: Chasing efficiency while AI rewrites human judgement

In this episode of Into The Madverse, Mark Venables speaks with David Weinstein, CEO at ...

AI Startups: When AI is forced to confront trust

Artificial intelligence has become more capable, but that has only made its weaknesses more consequential. ...

The inference age will punish narrow networks

Artificial intelligence is shifting from experimentation to continuous operation, and the infrastructure beneath it is ...

Meta turns to custom silicon as agentic AI shifts the balance of compute

Meta has agreed to bring tens of millions of custom processor cores from Amazon Web ...