Heimdal: Call for Immediate Review of AI Safety Standards

Recent findings by Anthropic, an AI safety start-up, have highlighted the risks associated with large language models (LLMs), prompting calls for a swift review of AI safety standards.

Deceptive-LLMs-2023-vs-2024-Anthropic_ARTIKKELIKUVA

The Anthropic team found that LLMs could become "sleeper agents," evading safety measures designed to prevent negative behaviors.

AI systems that act like humans to trick people are a problem for current safety training methods.

Even with thorough safety training, an AI’s hidden “sleeper agent” behavior can be triggered by something as simple as the year changing from 2023 to 2024 (scenario 1). This means the AI, while seeming harmless, could suddenly turn nasty.

Valentin Rusu, lead machine learning engineer at Heimdal Security and holder of a Ph.D. in AI, insists these findings demand immediate attention.

“It undermines the foundation of trust the AI industry is built on and raises questions about the responsibility of AI developers,” said Rusu.

R&D | 26.9.2024

SPX FLOW and Siemens Collaborate on Digital Twin and AI Product Design

SPX FLOW and Siemens collaborate on digital twin and AI product design

R&D | 30.1.2025

Green Transition Affects Materials Selection and Maintenance of Process Equipment

R&D | 16.12.2024

Use of Sensors to Optimize Maintenance and Lifetime

R&D | 16.12.2024

Less than Half of Top 50 Steel Producers Have a Net Zero Target

Less than half of the world’s top steel producers have targets to reach net zero emissions by midcentury, and even fewer track the full scope of emissions produced by their business, jeopardizing the sector’s ability to meet long-term climate aims, finds a new report from Global Energy Monitor and the Leadership Group for Industry Transition, hosted at the Stockholm Environment Institute.

R&D | 31.10.2024

Towards an Energy-Efficient Production Process – Measuring and Evaluating Cost Savings and Environmental Benefits

R&D | 16.10.2024

Research on Fossil-Free Iron and Steel Production on an Industrial Scale

R&D | 28.8.2024

Point of View

16.12.2024

What is Wrong with Maintenance...

Jaakko Tennilä

Editor-in-Chief, Maintworld magazine

16.10.2024

AI and Maintenance

Jaakko Tennilä

Editor-in-Chief, Maintworld magazine

5.6.2024

Asset Management

Jaakko Tennilä

Executive Director, Finnish maintenance society, Promaint
Editor-in-Chief, Maintworld magazine

Browse Magazine

Last issue

Do Lifts Need to Be Protected Against Cyber-Attacks?

Drones Have Received Official Permission to Maintain Wind Turbines

Prioritise People Eliminate the Not-invented-here Syndrome

Use of Sensors to Optimize Maintenance and Lifetime

Global Industrial Maintenance Services: Driving Asset Longevity, Reliability, and Market Growth

Case Study: Using Ultrasound to Detect Bearing Electrical Fluting

Lots of Talk, Little Practice

RCM or Not? Are You Shooting Mosquitos With a Big Canon?

Green Transition Affects Materials Selection and Maintenance of Process Equipment

Thriving in Chaos: How Industry 5.0 Redefines Resilience and Reliability

What is Wrong with Maintenance...

Less than Half of Top 50 Steel Producers Have a Net Zero Target

Call for Immediate Review of AI Safety Standards Following Research on Large Language Models

MRO for Automation Solutions Market Outlook (2024-2028)

Management 'bought the AI hype' and expect value but research shows lack of organizational readiness is primary hurdle

New Findings Could Lead to Safer, More Stable Metal Batteries

Uncharted Territory: Only Accurate Detection Can Swing the Pendulum Away From Dangerous Extremes

Heimdal: Call for Immediate Review of AI Safety Standards

SPX FLOW and Siemens Collaborate on Digital Twin and AI Product Design

Green Transition Affects Materials Selection and Maintenance of Process Equipment

Use of Sensors to Optimize Maintenance and Lifetime

Less than Half of Top 50 Steel Producers Have a Net Zero Target

Towards an Energy-Efficient Production Process – Measuring and Evaluating Cost Savings and Environmental Benefits

Research on Fossil-Free Iron and Steel Production on an Industrial Scale

Point of View

What is Wrong with Maintenance...

AI and Maintenance

Asset Management

MAINTWORLD MAGAZINE

Categories

CONTACT US

ADVERTISING

Order Newsletter

Browse Magazine

Last issue

Heimdal: Call for Immediate Review of AI Safety Standards

Share Article

Point of View

Latest articles

MAINTWORLD MAGAZINE

Categories

CONTACT US

ADVERTISING