Anthropic enhances Claude 4 with safety precautions to minimize the risk of users creating harmful weapons

Artificial intelligence control tightened for Claude Opus 4, Anthropic's latest AI model, as stated on Thursday.

, and Administrator

2025 May 27 . 7:42 PM

2 min read

AI control tightened for Claude Opus 4, the latest AI model developed by Anthropic, was activated... — AI control tightened for Claude Opus 4, the latest AI model developed by Anthropic, was activated on Thursday.

Anthropic enhances Claude 4 with safety precautions to minimize the risk of users creating harmful weapons

On Thursday, Anthropic opted to enforce stricter controls on their latest AI model, Claude Opus 4. The new controls, identified as AI Safety Level 3 (ASL-3), aim to restrict the model's potential misuse in the development or acquisition of chemical, biological, radiological, and nuclear (CBRN) weapons.

According to a blog post published by the company, the enhanced controls were put in place as a safeguard, as the team has yet to ascertain whether Opus 4 has crossed a significant threshold necessitating such protection. Anthropic noted that the new controls will not apply to Claude Sonnet 4.

Anthropic unveiled both Claude Opus 4 and Claude Sonnet 4 on Thursday, highlighting the models' advanced abilities to process vast amounts of data, execute prolonged tasks, generate human-quality content, and execute complex actions.

The company is backed by Amazon.

In addition to the ASL-3 controls, Anthropic employs several measures to ensure the responsible and secure use of their AI models. These include the use of Constitutional Classifiers to prevent harmful or inappropriate requests, specialized reinforcement learning training to resist prompt injection attacks, and ethical guidelines to align AI behavior with safety standards. The company also incorporates protections to prevent model weights from being stolen or exploited.

These measures are designed to mitigate the risks associated with powerful AI models, preventing them from being misused or manipulated.

In other news:

Microsoft employees reportedly encountered issues sending emails with certain keywords due to internal filters.
OpenAI's CFO expressed optimism about the impact of AI hardware on ChatGPT subscriptions in the future.
The founders of Amazon's PillPack launched a new health-care marketplace startup, General Medicine.
Hinge Health's shares surged by 17% in its NYSE debut.

[1] https://arxiv.org/abs/2205.15498[2] https://arxiv.org/abs/2106.02281[3] https://arxiv.org/abs/2102.13075[4] https://arxiv.org/abs/2012.15472[5] https://arxiv.org/abs/1803.01995

The startup, Anthropic, announced the launch of their advanced AI models, Claude Opus 4 and Claude Sonnet 4, on Thursday. In light of potential risks, they have implemented stricter controls, specifically AI Safety Level 3 (ASL-3), for Claude Opus 4 to safeguard against its potential misuse in technology related to chemical, biological, radiological, and nuclear (CBRN) weapons.

Latest

Initial Findings Reveal Revolutionary Explanation for Lightning Origins

All about technology.

Initiation of Lightning Explained: New Study Offers Revolutionary Insight

The enigma surrounding the creation of lightning within clouds may finally be unraveled, as recent scientific findings suggest a plausible resolution.

, and Administrator

2025 July 31

Struggles of Tesla Post Quarter 2: Challenges for Electric Vehicles in the USA and Global...

All about technology.

Struggling Tesla After a Disappointing Q2 Highlights Challenges for Electric Vehicles in the US and Challenges to Its Global Image

Tesla's Q2 financial fallout exposes a company facing multiple challenges, including expensive and tardy Cybertruck production, a deteriorating electric vehicle market, impending exhaustion of federal tax incentives, and potentially damaging brand reputation dilemmas that cast doubt on its...

, and Administrator

2025 July 31

Driving a 2023 F-150 Lightning for a 400-mile trip demonstrated comparable cost to gas, yet the...

All about technology.

Driving a 2023 F-150 Lightning for 400 miles consumed the same cost as fuel, but the journey took an extra 103 minutes due to required charging stops.

Long-haul electric vehicle (EV) travel legendary: Man drove his 2023 F-150 Lightning 400 miles, matching gas costs yet adding an extra 103 minutes to his journey. Is long-distance EV travel genuinely practical?

, and Administrator

2025 July 31

Secure Communications Method Through REM Certification: The Latest European Safety Standard

All about technology.

Secure Email Communication Under European Regulation: Introduction of REM Certification

Explore the shift from PEC to REM, the newly endorsed email platform, valid across the European Union. Uncover the distinctions, benefits, and timelines associated with this change.

, and Administrator

2025 July 31

Anthropic enhances Claude 4 with safety precautions to minimize the risk of users creating harmful weapons

Anthropic enhances Claude 4 with safety precautions to minimize the risk of users creating harmful weapons

Read also:

Related

Latest