AI models are becoming more autonomous and harder to constrain

Mark Venables

Share this article

The latest generation of artificial intelligence systems is beginning to shift from responsive tools to semi-autonomous agents, capable of handling complex tasks with minimal oversight. This transition is raising new questions about reliability, control and the boundaries between capability and risk.

Anthropic has released Claude Opus 4.7, the latest version of its large language model, with improvements focused on advanced software engineering, multimodal capabilities and long-running task execution. The model builds on its predecessor, Opus 4.6, and is positioned as part of a broader effort to refine AI systems before deploying more powerful models at scale.

The development comes in the context of growing scrutiny around AI safety, particularly following Anthropic’s own recent work on Project Glasswing, which highlighted both the potential and the risks of increasingly capable models in areas such as cybersecurity.

From assistance to autonomy

One of the defining characteristics of Opus 4.7 is its ability to handle complex, multi-step tasks with greater consistency and precision. Early feedback suggests that users are increasingly able to delegate difficult coding work to the model, including tasks that previously required close human supervision.

This reflects a broader shift in how AI is being used. Rather than acting as a tool that responds to individual prompts, the model is capable of sustaining longer workflows, verifying its own outputs and maintaining coherence across extended interactions. These capabilities are reinforced by improvements in memory, allowing the system to retain and apply information across multiple sessions.

The introduction of more granular control over reasoning effort also highlights the changing nature of AI usage. By allowing users to adjust the balance between speed and depth of reasoning, the model can be tuned for different types of tasks, from quick responses to more complex problem-solving.

At the same time, enhancements in multimodal capabilities expand the scope of what the model can process. With support for higher-resolution images, Opus 4.7 can interpret detailed visual information, enabling applications that rely on precise visual analysis, such as reading dense screenshots or extracting data from complex diagrams.

Capability increases bring new constraints

Despite these advances, the release of Opus 4.7 is closely tied to ongoing concerns about safety and misuse. Anthropic has indicated that the model is less capable in certain high-risk areas than its more powerful Mythos Preview system, reflecting a deliberate effort to limit exposure while testing safeguards.

These safeguards include mechanisms to detect and block requests associated with prohibited or high-risk cybersecurity uses. The company has also introduced a Cyber Verification Program to allow legitimate users, such as security professionals, to access the model for approved purposes.

The approach underscores a broader challenge facing the industry. As models become more capable, the risk of misuse increases, particularly in domains where AI can amplify existing vulnerabilities. Balancing accessibility with control is becoming a central issue in how these systems are deployed.

Anthropic’s own assessments suggest that Opus 4.7 maintains a similar safety profile to its predecessor, with improvements in areas such as resistance to prompt injection attacks and honesty, but some weaknesses in others. The model is described as largely well-aligned and trustworthy, though not without limitations.

The release of Opus 4.7 illustrates a pattern that is becoming increasingly common in AI development. Progress is no longer defined solely by increases in capability, but by the ability to manage the risks that accompany those gains. As models take on more autonomous roles, the question is not only what they can do, but how reliably they can be constrained.

In that context, the evolution of systems like Opus 4.7 suggests that the future of AI will depend as much on governance and control as on technical performance. The challenge for developers and users alike is to ensure that increasing autonomy does not outpace the mechanisms designed to manage it.